models
18
ReasoningEval/huatuo_sft_m23k_grpo_qwen3-14b
15B
•
Updated
•
4
ReasoningEval/huatuo_sft_m23k_grpo_qwen3-8b
8B
•
Updated
•
3
ReasoningEval/huatuo_sft_m23k_grpo_llama31-8b
8B
•
Updated
•
2
ReasoningEval/openr1_sft_PRIME_grpo_qwen3-14b
15B
•
Updated
•
4
ReasoningEval/openr1_sft_PRIME_grpo_qwen3-8b
8B
•
Updated
•
4
ReasoningEval/openr1_sft_PRIME_grpo_llama31-8b
8B
•
Updated
•
4
ReasoningEval/openr1_sft_qwen3-8b
8B
•
Updated
•
4
ReasoningEval/openr1_sft_qwen3-14b
425k
•
Updated
•
3
ReasoningEval/openr1_sft_llama31-8b
8B
•
Updated
•
4
ReasoningEval/huatuo_sft_qwen3-8b
8B
•
Updated
•
5