LLM trained via RL for reasoning tasks.
Peiyan He
CaptainHPY
AI & ML interests
None yet
Recent Activity
updated
a model
22 days ago
CaptainHPY/Qwen2.5-7B-R1-GGUF
liked
a dataset
22 days ago
open-r1/DAPO-Math-17k-Processed
liked
a dataset
22 days ago
unsloth/OpenMathReasoning-mini
Organizations
None yet