-
CohenQu/Qwen3-4B-Base_HintGen-withSol.00.00
Text Generation • 4B • Updated • 14 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.00.01
Text Generation • 4B • Updated • 16 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.01.00
Text Generation • 4B • Updated • 13 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.01.01
Text Generation • 4B • Updated • 13
Yuxiao Qu PRO
CohenQu
AI & ML interests
None yet
Organizations
Flexible Ordering
-
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.02
3B • Updated • 10 -
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.03
3B • Updated • 12 -
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.04
3B • Updated • 11 -
CohenQu/sft_completion_only_finemath-4plus-flexible-ordering.00.00-Step2000_numina-cot-100k_babel
Text Generation • 4B • Updated • 12
RLAD
-
CohenQu/Qwen3-4B-Base_HintGen-withSol.00.00
Text Generation • 4B • Updated • 14 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.00.01
Text Generation • 4B • Updated • 16 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.01.00
Text Generation • 4B • Updated • 13 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.01.01
Text Generation • 4B • Updated • 13
Flexible Ordering
-
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.02
3B • Updated • 10 -
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.03
3B • Updated • 12 -
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.04
3B • Updated • 11 -
CohenQu/sft_completion_only_finemath-4plus-flexible-ordering.00.00-Step2000_numina-cot-100k_babel
Text Generation • 4B • Updated • 12
models
391

CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_with_reasoning_v1.1_orchard
8B
•
Updated
•
17

CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_4096_with_reasoning_v2.1_orchard
8B
•
Updated
•
28

CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_4096_with_reasoning_v2_orchard
8B
•
Updated
•
25

CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_with_reasoning_v2_orchard
8B
•
Updated
•
116

CohenQu/Qwen2.5-ARC-AGI-4-8-10_3x128_shuffled_tb_32_bs_512_minibs_32_microbs_16_n_16_tp_0.6
3B
•
Updated
•
20

CohenQu/Qwen3-4B-ARC-AGI-4-8-10_tb_64_bs_256_minibs_16_microbs_16_n_16
4B
•
Updated
•
15

CohenQu/Qwen3-1.7B-ARC-AGI-4-8-10_tb_64_bs_256_minibs_16_microbs_16_n_16
2B
•
Updated
•
22

CohenQu/Meta-Llama-3-8B-Instruct_Mixture-of-Thoughts-all-4k-with_reasoning
Text Generation
•
1B
•
Updated
•
38

CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-all-4k_with_reasoning_fixed_DSAI
8B
•
Updated
•
31

CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-all-4k_without_reasoning_fixed_DSAI
8B
•
Updated
•
41
datasets
256
CohenQu/Omni-MATH-Qwen3-4B-16k_hard_human_prompts
Updated
•
15
CohenQu/Omni-MATH-human_easy-first-guide_verl
Updated
•
17
CohenQu/Omni-MATH-Qwen3-4B-16k_hard_human_sol
Updated
•
20
CohenQu/Omni-MATH-gemini_easy-mix-guide_easy-no-guide_verl
Updated
•
5
CohenQu/Omni-MATH-gemini_easy-first-guide_easy-no-guide_verl
Updated
•
5
CohenQu/Omni-MATH-gemini_easy-mix-guide_verl
Updated
•
5
CohenQu/Omni-MATH-gemini_easy-first-guide_verl
Updated
•
6
CohenQu/Omni-MATH-Qwen3-4B_hard_only-gemini-all-guide_verl
Viewer
•
Updated
•
2.16k
•
34
CohenQu/Omni-MATH-Qwen3-4B_hard_mix-gemini-all-guide_verl
Viewer
•
Updated
•
2.29k
•
33
CohenQu/Omni-MATH-Qwen3-4B_hard_no-gemini-guide_verl
Viewer
•
Updated
•
202
•
51