81 33 214

Junyang Lin

JustinLin610

https://justinlin610.github.io

AI & ML interests

Pretraining, NLP, CV, etc.

Recent Activity

updated a model about 10 hours ago

Qwen/Qwen3-Embedding-8B-GGUF

updated a model about 10 hours ago

Qwen/Qwen3-Embedding-4B-GGUF

updated a model about 10 hours ago

Qwen/Qwen3-Embedding-8B

View all activity

Organizations

JustinLin610's activity

upvoted 7 collections about 10 hours ago

upvoted a paper 3 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published 3 days ago • 122

upvoted a paper 18 days ago

Qwen3 Technical Report

Paper • 2505.09388 • Published 22 days ago • 181

upvoted a paper 19 days ago

Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published 21 days ago • 78

upvoted 2 papers 2 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 114

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 156

upvoted a collection 3 months ago

Qwen2.5-1M

Collection

The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated Apr 28 • 119

upvoted 2 papers 5 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 98

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2 • 53

upvoted 3 papers 7 months ago

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

Paper • 2411.04928 • Published Nov 7, 2024 • 58

BitNet a4.8: 4-bit Activations for 1-bit LLMs

Paper • 2411.04965 • Published Nov 7, 2024 • 69

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published Nov 7, 2024 • 125

upvoted 2 collections 9 months ago

Moshi v0.1 Release

Collection

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 15 items • Updated Apr 18 • 231

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Apr 28 • 616