Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
dada22231
/
6296264f-6ba2-49a8-99cc-8e195aa8cd5b
like
0
PEFT
Safetensors
qwen2
axolotl
Generated from Trainer
trl
grpo
License:
apache-2.0
Model card
Files
Files and versions
Community
Use this model
main
6296264f-6ba2-49a8-99cc-8e195aa8cd5b
/
tokenizer_config.json
Commit History
Training in progress, step 100
5620ab7
verified
dada22231
commited on
6 days ago