luckeciano/Qwen-2.5-7B-GRPO-NoBaseline-FisherMaskGlobal-1e-12_5894 Text Generation • 8B • Updated 27 days ago • 22
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline-HessianMaskToken-1.0_4859 Text Generation • 8B • Updated 27 days ago • 18
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline-HessianMaskSentence-1e-3_2991 Text Generation • 8B • Updated 27 days ago • 20
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline-HessianMaskToken-1.0_5849 Text Generation • 8B • Updated 27 days ago • 18
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline-HessianMaskSentence-1e-3_3401 Text Generation • 8B • Updated 27 days ago • 22
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline-HessianMaskSentence-1e-3_4992 Text Generation • 8B • Updated 27 days ago • 5
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline-FisherMaskSentence-1e-7-HessianMaskSentence-1e-6_6372 Text Generation • 8B • Updated 17 days ago • 14
luckeciano/Qwen-2.5-7B-GRPO-NoBaseline-FisherMaskSentence-1e-7-HessianMaskSentence-1e-5_1855 Text Generation • 8B • Updated 17 days ago • 13
cutelemonlili/Qwen2.5-Math-7B_Teacher_forget_RL_data_QwQ-Preview Text Generation • 8B • Updated 1 day ago • 2