kaiwenw/nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_4 Text Generation • Updated Dec 7, 2024 • 14
kaiwenw/nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_3 Text Generation • Updated Dec 7, 2024 • 12
kaiwenw/nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_2 Text Generation • Updated Dec 7, 2024 • 12
kaiwenw/nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_1 Text Generation • Updated Dec 7, 2024 • 13
kaiwenw/distill-r1-qwen-1.5b-hmmt-feb-25-4096-with-bt-model-with-sigmoid Viewer • Updated about 1 month ago • 123k • 73
kaiwenw/distill-r1-qwen-1.5b-hmmt-feb-24-4096-with-bt-model-with-sigmoid Viewer • Updated about 1 month ago • 123k • 121
kaiwenw/distill-r1-qwen-1.5b-aime-25-4096-with-bt-model-with-sigmoid Viewer • Updated about 1 month ago • 123k • 83
kaiwenw/distill-r1-qwen-1.5b-aime-24-4096-with-bt-model-with-sigmoid Viewer • Updated about 1 month ago • 123k • 60
kaiwenw/distill-r1-qwen-1.5b-hmmt-feb-25-4096-with-bt-model-wout-sigmoid Viewer • Updated about 1 month ago • 123k • 64
kaiwenw/distill-r1-qwen-1.5b-hmmt-feb-24-4096-with-bt-model-wout-sigmoid Viewer • Updated about 1 month ago • 123k • 62
kaiwenw/distill-r1-qwen-1.5b-aime-25-4096-with-bt-model-wout-sigmoid Viewer • Updated about 1 month ago • 123k • 93
kaiwenw/distill-r1-qwen-1.5b-aime-24-4096-with-bt-model-wout-sigmoid Viewer • Updated about 1 month ago • 123k • 166
kaiwenw/distill-r1-qwen-1.5b-hmmt-feb-25-4096-with-old-prm-indices_61440_69120 Viewer • Updated about 1 month ago • 7.68k • 52
kaiwenw/distill-r1-qwen-1.5b-hmmt-feb-25-4096-with-old-prm-indices_76800_84480 Viewer • Updated about 1 month ago • 7.68k • 53