Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2505.16400

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published Feb 20 • 63
o1-Coder: an o1 Replication for Coding

Paper • 2412.00154 • Published Nov 29, 2024 • 45
Competitive Programming with Large Reasoning Models

Paper • 2502.06807 • Published Feb 3 • 70
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25 • 74

Math and Code reasoning model trained through reinforcement learning (RL)

about 11 hours ago

nvidia/AceReason-Nemotron-14B

Text Generation • Updated 4 days ago • 44.2k • • 78
nvidia/AceReason-Nemotron-7B

Text Generation • Updated 4 days ago • 39k • • 11
AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Paper • 2505.16400 • Published 16 days ago • 30
nvidia/AceReason-Math

Viewer • Updated 4 days ago • 49.6k • 339 • 4

MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning

Paper • 2505.10557 • Published 22 days ago • 46
AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Paper • 2505.16400 • Published 16 days ago • 30

J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning

Paper • 2505.10320 • Published 23 days ago • 22
Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published 24 days ago • 63
Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published 22 days ago • 118
Scaling Reasoning can Improve Factuality in Large Language Models

Paper • 2505.11140 • Published 22 days ago • 6

Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers

Paper • 2504.20752 • Published Apr 29 • 91
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Paper • 2504.21233 • Published Apr 30 • 45
AF Adapter: Continual Pretraining for Building Chinese Biomedical Language Model

Paper • 2211.11363 • Published Nov 21, 2022 • 1
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

Paper • 2405.12130 • Published May 20, 2024 • 51

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs