Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

CoREDumPSeGfault
/
Qwen2-0.5B-GRPO-4

Transformers
TensorBoard
Safetensors
Generated from Trainer
trl
grpo
Model card Files Files and versions
xet
Metrics Training metrics Community
Qwen2-0.5B-GRPO-4 / runs
99.4 kB
  • 1 contributor
History: 45 commits
CoREDumPSeGfault's picture
CoREDumPSeGfault
Training in progress, step 113
a98e2eb verified 6 months ago
  • Apr28_15-22-08_1b39a6c4b4c4
    Training in progress, step 10 6 months ago
  • Apr29_15-44-56_cf86c107cc8f
    Training in progress, step 80 6 months ago
  • Apr29_17-18-26_f72a3401bd1a
    Training in progress, step 113 6 months ago
  • Apr29_17-41-10_f72a3401bd1a
    Training in progress, step 113 6 months ago
  • May01_05-01-22_178cea525788
    Training in progress, step 113 6 months ago