Qwen2.5-7B-Open-R1-GRPO / train_results.json
Yukang's picture
Model save
c6db779 verified
raw
history blame
205 Bytes
{
"total_flos": 0.0,
"train_loss": 2.3079139061880094e-05,
"train_runtime": 485.9244,
"train_samples": 93733,
"train_samples_per_second": 192.896,
"train_steps_per_second": 12.057
}