Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
virtuoussy
's Collections
RLVR
RLVR
updated
15 days ago
Model and data for 'Expanding RL with Verifiable Rewards Across Diverse Domains'
Upvote
11
+1
virtuoussy/Qwen2.5-7B-Instruct-RLVR
Updated
13 days ago
•
105
•
11
virtuoussy/Math-RLVR
Viewer
•
Updated
13 days ago
•
782k
•
240
•
6
virtuoussy/Multi-subject-RLVR
Viewer
•
Updated
13 days ago
•
579k
•
959
•
51
Upvote
11
+7
Share collection
View history
Collection guide
Browse collections