weege007/Qwen2.5-1.5B-Instruct_grpo_Countdown-Tasks-3to4 Text Generation • 2B • Updated Jun 8 • 13
weege007/Qwen2.5-1.5B-Instruct_grpo_Countdown-Tasks-3to4 Text Generation • 2B • Updated Jun 8 • 13
weege007/Qwen2.5-0.5B-Instruct_grpo_Countdown-Tasks-3to4 Text Generation • 0.5B • Updated Jun 8 • 8
weege007/Qwen2.5-0.5B-Instruct_grpo_Countdown-Tasks-3to4 Text Generation • 0.5B • Updated Jun 8 • 8
Running 2.78k 2.78k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters