DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_12_pt Text Generation • 0.7B • Updated Dec 9, 2024 • 36
DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_12_sft Text Generation • 0.7B • Updated Dec 9, 2024 • 27
DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_6_pt Text Generation • 0.6B • Updated Dec 9, 2024 • 35
DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_6_sft Text Generation • 0.6B • Updated Dec 9, 2024 • 32
DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_pt Text Generation • 0.5B • Updated Dec 7, 2024 • 24
DongfuJiang/prm_gsm_2k_with_full_sol_mix_ref_remove_all_correct_hf Text Generation • 8B • Updated Dec 1, 2024 • 14 • 1
DongfuJiang/prm_qwen25_math_gsm_2k_with_full_sol_mix_ref_redistribution_hf Text Generation • 8B • Updated Dec 1, 2024 • 17
DongfuJiang/prm_gsm_2k_with_full_sol_mix_ref_redistribution_hf Text Generation • 8B • Updated Nov 30, 2024 • 16
DongfuJiang/prm_gsm_2k_with_full_sol_mix_ref_error_only_hf Text Generation • 8B • Updated Nov 29, 2024 • 14