Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

rkumar1999
/
Llama3.2-3B-Prover-openr1-distill-GRPO

Text Generation
Transformers
TensorBoard
Safetensors
llama
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
Model card Files Files and versions
xet
Metrics Training metrics Community
Llama3.2-3B-Prover-openr1-distill-GRPO / runs
43.1 kB
  • 1 contributor
History: 1 commit
rkumar1999's picture
rkumar1999
Training in progress, epoch 0
59ed3e8 verified 25 days ago
  • Oct06_18-36-09_a5810eaa
    Training in progress, epoch 0 25 days ago
  • Oct06_19-07-36_a5810eaa
    Training in progress, epoch 0 25 days ago
  • Oct06_19-26-27_a5810eaa
    Training in progress, epoch 0 25 days ago
  • Oct06_19-49-23_a5810eaa
    Training in progress, epoch 0 25 days ago