Running 535 535 Scaling test-time compute 📈 Enhance math problem solving by scaling test-time compute
UNIVA-Bllossom/DeepSeek-llama3.3-Bllossom-70B Text Generation • Updated 27 days ago • 3.46k • 52
UNIVA-Bllossom/DeepSeek-llama3.1-Bllossom-8B Text Generation • Updated 29 days ago • 7.85k • 38
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 346