Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
daqc 's Collections
Dataset Best Practices
LRMs
Agents
Thinkers
Low-Resource Data
Reasoning LLMs
Multilingual
Read later
SLMs
Safety
Reinforcement
on-Device (phone)
Frameworks
Domain-specific

Reasoning LLMs

updated Jan 27
Upvote
-

  • HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

    Paper • 2412.18925 • Published Dec 25, 2024 • 103

  • Search-o1: Agentic Search-Enhanced Large Reasoning Models

    Paper • 2501.05366 • Published Jan 9 • 101

  • rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

    Paper • 2501.04519 • Published Jan 8 • 277

  • DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

    Paper • 2501.12948 • Published Jan 22 • 391
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs