CohenQu/gemini_easy-first-guide_verl_e3_tb_32_minibs_16_microbs_16_n_8_tp_0.8_cphigh_0.3 2B • Updated about 13 hours ago • 4
CohenQu/gemini_easy-first-guide_verl_1.7B_tb_32_minibs_16_microbs_16_n_8_tp_0.7_cphigh_0.3 2B • Updated 3 days ago • 11
CohenQu/gemini_easy-first-guide_verl_tb_32_minibs_16_microbs_16_n_8_tp_0.8_cphigh_0.3 4B • Updated 4 days ago • 15
CohenQu/gemini_easy-mix-guide_verl_tb_32_minibs_16_microbs_16_n_8_tp_0.8_cphigh_0.3 4B • Updated 4 days ago • 36
CohenQu/human_easy-first-guide_verl_v2_tb_32_minibs_16_microbs_16_n_8_tp_0.8_cphigh_0.3 4B • Updated 4 days ago • 24
CohenQu/gemini_easy-first-guide_verl_tb_32_minibs_16_microbs_16_n_8_tp_0.8_cphigh_0.35 4B • Updated 5 days ago • 32
CohenQu/Qwen3-1.7B_reasoning_cache_deepscalr_16k_sft_e2e_prompt_summaries_2048_small_resize 2B • Updated 5 days ago • 20
CohenQu/Qwen3-1.7B_reasoning_cache_deepscalr_16k_sft_e2e_prompt_summaries_2048_small 2B • Updated 5 days ago • 21
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_with_reasoning_v1.1_orchard 8B • Updated 11 days ago • 26
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_4096_with_reasoning_v2.1_orchard 8B • Updated 11 days ago • 31
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_4096_with_reasoning_v2_orchard 8B • Updated 11 days ago • 28
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_with_reasoning_v2_orchard 8B • Updated 13 days ago • 144
CohenQu/Qwen2.5-ARC-AGI-4-8-10_3x128_shuffled_tb_32_bs_512_minibs_32_microbs_16_n_16_tp_0.6 3B • Updated 21 days ago • 27
CohenQu/Qwen3-1.7B-ARC-AGI-4-8-10_tb_64_bs_256_minibs_16_microbs_16_n_16 2B • Updated 27 days ago • 23
CohenQu/Meta-Llama-3-8B-Instruct_Mixture-of-Thoughts-all-4k-with_reasoning Text Generation • 1B • Updated 29 days ago • 39
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-all-4k_with_reasoning_fixed_DSAI 8B • Updated Aug 19 • 4
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-all-4k_without_reasoning_fixed_DSAI 8B • Updated Aug 19 • 13
CohenQu/Qwen3-1.7B-deepscalar_RL_hard_500_verl_bs_256_minibs_16_microbs_16_n_16 2B • Updated Aug 18 • 5
CohenQu/Qwen3-1.7B-deepscalar_RL_hard_500_verl_bs_512_minibs_16_microbs_16_n_32 2B • Updated Aug 18 • 5
CohenQu/Meta-Llama-3-8B-Instruct_Mixture-of-Thoughts-all-4k-without_reasoning Text Generation • 1B • Updated Aug 17 • 34
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_without_reasoning_fixed_DSAI Feature Extraction • 8B • Updated Aug 17 • 10
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_without_reasoning_DSAI Feature Extraction • 8B • Updated Aug 16 • 6
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_with_reasoning_fixed_DSAI Feature Extraction • 8B • Updated Aug 16 • 7
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_with_reasoning_DSAI Feature Extraction • 8B • Updated Aug 16 • 8
CohenQu/Qwen3-1.7B-deepscalar_RL_hard_500_verl_bs_128_minibs_32_microbs_32_n_4 2B • Updated Aug 16 • 5
CohenQu/Qwen3-1.7B-deepscalar_RL_hard_500_verl_bs_128_minibs_16_microbs_16_n_8 2B • Updated Aug 16 • 4
CohenQu/Meta-Llama-3-8B-Instruct_Mixture-of-Thoughts-math-4k-with_reasoning Text Generation • 1B • Updated Aug 16 • 90
CohenQu/Meta-Llama-3-8B-Instruct_Mixture-of-Thoughts-math-4k-without_reasoning Text Generation • 1B • Updated Aug 15 • 70