Model and data for 'Expanding RL with Verifiable Rewards Across Diverse Domains'
Yi Su
virtuoussy
AI & ML interests
None yet
Recent Activity
new activity
about 2 months ago
virtuoussy/Qwen2.5-7B-Instruct-RLVR:如何使用
new activity
about 2 months ago
virtuoussy/Qwen2.5-7B-Instruct-RLVR:Improve language tag
new activity
about 2 months ago
virtuoussy/Qwen2.5-7B-Instruct-RLVR:Improve language tag
Organizations
None yet