Dai
Yinpei
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 20 hours ago
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective
Reinforcement Learning for LLM Reasoning
published
a model
21 days ago
Yinpei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
published
a model
21 days ago
Yinpei/Qwen2.5-1.5B-Open-R1-Distill
Organizations
Collections
1
models
9
Yinpei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
Yinpei/Qwen2.5-1.5B-Open-R1-Distill
Updated
Yinpei/runs_ckpt
Updated
Yinpei/real_h5dy
Updated
Yinpei/racer-visuomotor-policy-simple
Updated
Yinpei/racer-visuomotor-policy-rich
Updated
Yinpei/racer-llava-llama3-lora-simple
Updated
•
1
Yinpei/racer-llava-llama3-lora-rich-betterswitch
Updated
•
2
Yinpei/racer-llava-llama3-lora-rich
Updated
•
1