Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
hkust-nlp
's Collections
RL-Verifier-Pitfalls
Laser
SimpleRL-Zoo
SimpleRL
PreSelect
M-STAR
CodeI/O
Deita
🎯DART-Math
M-STAR
updated
23 days ago
Resources of M-STAR (Multimodal Self-Evolving Training for Reasoning) https://mstar-lmm.github.io/
Upvote
4
hkust-nlp/mstar-8b-v1.0
Updated
Dec 25, 2024
•
6
•
2
hkust-nlp/mstar-prm-8b-v1.0
Updated
Dec 25, 2024
•
13
•
2
Upvote
4
Share collection
View history
Collection guide
Browse collections