khuang2/qwen-2.5-3b-r1-countdown-offline_query_gen_solvable_only Text Generation • 3B • Updated Feb 6 • 5
khuang2/qwen-2.5-3b-r1-countdown-offline_query_gen_solvable_only__train_query_gen-ckpt_175 Text Generation • 3B • Updated Feb 7 • 5
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_vdebug Text Generation • 3B • Updated Feb 7 • 8
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v8__steps_450__bs_56__lr_5e7__seed_42 Text Generation • 3B • Updated Feb 9 • 5
khuang2/qwen-2.5-3b-r1-countdown-train_query_and_policy_v9__steps_450__bs_56__lr_5e7__seed_42 Text Generation • 3B • Updated Feb 9 • 6
khuang2/qwen-2.5-3b-r1-countdown_v1__steps_450__bs_224__lr_5e7__seed_42 Text Generation • 3B • Updated Feb 8 • 4