Curation of resources used in the paper "Demystifying Long Chain-of-Thought Reasoning in LLMs"
-
Demystifying Long Chain-of-Thought Reasoning in LLMs
Paper • 2502.03373 • Published • 59 -
demystify-long-cot/math-train-qwq-rs-n256
Viewer • Updated • 1.14M • 32 • 1 -
demystify-long-cot/llama-3.1-8b-math-qwq-n256-rft
8B • Updated • 11 -
demystify-long-cot/math-train-qwq-rs-n192
Viewer • Updated • 854k • 32