Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
hkust-nlp 's Collections
SimpleRL-Zoo
SimpleRL
PreSelect
CodeI/O
M-STAR
Deita
🎯DART-Math

SimpleRL

updated Feb 19

The collection for the Project "Simple Reinforcement Learning for Reasoning"

Upvote
7

  • hkust-nlp/Qwen-2.5-Math-7B-SimpleRL-Zero

    Updated Feb 23 • 59 • 3

  • hkust-nlp/Qwen-2.5-Math-7B-SimpleRL

    Updated Feb 23 • 9 • 4
Upvote
7
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs