Math GitBag/gemma-2-9b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 54 GitBag/llama-3_1-70b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 39 GitBag/gemma-2-27b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 44 GitBag/llama-3-8b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 37
Math GitBag/gemma-2-9b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 54 GitBag/llama-3_1-70b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 39 GitBag/gemma-2-27b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 44 GitBag/llama-3-8b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 37
GitBag/a_star_final_a_star_math_1.5_random_reward_actor Text Generation • 2B • Updated 19 days ago • 27
GitBag/a_star_final_a_star_math_1.5_wrong_reward_actor Text Generation • 2B • Updated 19 days ago • 24
GitBag/a_star_final_a_star_math_3_random_reward_actor Text Generation • 3B • Updated 19 days ago • 25
GitBag/a_star_final_a_star_math_7_random_reward_actor Text Generation • 8B • Updated 19 days ago • 62
GitBag/a_star_final_ds-distilled-qwen-1.5b-a-star-16384_actor Text Generation • 2B • Updated May 27 • 9
GitBag/a_star_final_ds-distilled-qwen-1.5b-grpo-2-kl-1e-4-16384_actor Text Generation • 2B • Updated May 27 • 26
GitBag/a_star_final_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-16384_critic Token Classification • 2B • Updated May 12 • 6
GitBag/a_star_final_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-16384_actor Text Generation • 2B • Updated May 12 • 40