6 12 42

Dacheng Li

DachengLi

https://dachengli1.github.io

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

liked a model 8 days ago

nvidia/Nemotron-Research-Reasoning-Qwen-1.5B

liked a dataset 12 days ago

nvidia/Llama-Nemotron-Post-Training-Dataset

View all activity

Organizations

DachengLi's activity

upvoted a paper 5 days ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published 11 days ago • 118

liked a model 8 days ago

nvidia/Nemotron-Research-Reasoning-Qwen-1.5B

Text Generation • Updated 6 days ago • 4.54k • 144

liked a dataset 12 days ago

nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated May 8 • 3.91M • 11.3k • 500

liked a model 12 days ago

unsloth/DeepSeek-R1-0528-GGUF

Text Generation • Updated about 1 hour ago • 91.1k • 144

liked a dataset 15 days ago

open-r1/Mixture-of-Thoughts

Viewer • Updated 16 days ago • 699k • 31.3k • 209

updated a dataset 26 days ago

Efficient-Large-Model/worldmodelbench

Updated 26 days ago • 160

upvoted a paper about 2 months ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published Apr 15 • 61

liked 2 models 2 months ago

Qwen/Qwen2.5-Coder-7B-Instruct

Text Generation • Updated Jan 12 • 372k • • 485

all-hands/openhands-lm-32b-v0.1

Text Generation • Updated Apr 16 • 215k • • 381

upvoted a paper 2 months ago

SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation

Paper • 2503.09641 • Published Mar 12 • 39

updated a dataset 3 months ago

DachengLi/d1k

Viewer • Updated Mar 26 • 1k • 35

published a dataset 3 months ago

DachengLi/d1k

Viewer • Updated Mar 26 • 1k • 35

authored a paper 3 months ago

LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!

Paper • 2502.07374 • Published Feb 11 • 41

upvoted a collection 3 months ago

NovaSky Papers

Collection

2 items • Updated Feb 21 • 3

commented a paper 4 months ago

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published Feb 20 • 63 •

authored a paper 4 months ago

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published Feb 20 • 63

upvoted a paper 4 months ago

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published Feb 20 • 63

liked a dataset 5 months ago

PRIME-RL/Eurus-2-RL-Data

Viewer • Updated Feb 19 • 483k • 468 • 38

liked 2 datasets 6 months ago

AlexCuadron/SWE-Bench-Verified-O1-reasoning-high-results

Viewer • Updated Dec 29, 2024 • 495 • 5.94k • 5

codeparrot/apps

Updated Oct 20, 2022 • 4.13k • 174