DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper โข 2501.12948 โข Published 8 days ago โข 270
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training Paper โข 2501.11425 โข Published 10 days ago โข 84
PaSa: An LLM Agent for Comprehensive Academic Paper Search Paper โข 2501.10120 โข Published 13 days ago โข 40
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Paper โข 2501.09732 โข Published 14 days ago โข 66
Towards Best Practices for Open Datasets for LLM Training Paper โข 2501.08365 โข Published 16 days ago โข 51
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper โข 2501.08313 โข Published 16 days ago โข 271