models
22
yifeng2025summer/real-dapo-code_reason-qwen2_5-7b-ipython-force-valid-action-3turn-step_13
8B
•
Updated
•
3
yifeng2025summer/real-dapo-code_reason-qwen2_5-7b-ipython-force-valid-action-3turn-step_7
8B
•
Updated
•
3
yifeng2025summer/grpo-code_reason-llama3_2-3b-ipython-force-valid-action-3turn-sp2-step_17
4B
•
Updated
•
5
yifeng2025summer/gtpo-code_reason-llama3-2-3b-ipython-force-valid-action-3turn-step_23
4B
•
Updated
•
4
yifeng2025summer/code_reason-qwen2.5-7b-cold-start-retool-sft-epoch-3
8B
•
Updated
•
6
yifeng2025summer/qwen2.5_32b_sft_epoch3
1.12M
•
Updated
•
3
yifeng2025summer/qwen2.5_14b_sft_epoch3
15B
•
Updated
•
4
yifeng2025summer/qwen2.5_3b_sft_epoch3
3B
•
Updated
•
4
yifeng2025summer/qwen2.5_1.5b_sft_epoch3
2B
•
Updated
•
7
yifeng2025summer/qwen2.5_7b_gtpo_difflib_step30
8B
•
Updated
•
5