Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

meshalJcheema
/
Qwen2-0.5B-GRPO-test

Transformers
TensorBoard
Safetensors
Generated from Trainer
trl
grpo
Model card Files Files and versions
xet
Metrics Training metrics Community
Qwen2-0.5B-GRPO-test / runs
25.7 kB
  • 1 contributor
History: 11 commits
meshalJcheema's picture
meshalJcheema
Training in progress, step 3
0bcdcb0 verified 6 months ago
  • Apr15_10-58-52_d13786939d9e
    Training in progress, step 90 6 months ago
  • Apr15_12-24-44_3dd80a3c6041
    Training in progress, step 19 6 months ago
  • Apr16_16-20-57_56634c23df0e
    Training in progress, step 3 6 months ago