Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
llavallava
/
qwen2.5-3b-instruct-trl-sft-lora-social_debug
like
0
Transformers
TensorBoard
Safetensors
Generated from Trainer
trl
sft
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
qwen2.5-3b-instruct-trl-sft-lora-social_debug
Commit History
Model save
b6870cb
verified
llavallava
commited on
Feb 4
Training in progress, step 243
775e7b2
verified
llavallava
commited on
Feb 4
Training in progress, step 240
9b38d87
verified
llavallava
commited on
Feb 4
Training in progress, step 220
584da04
verified
llavallava
commited on
Feb 4
Training in progress, step 200
841b814
verified
llavallava
commited on
Feb 4
Training in progress, step 180
003be75
verified
llavallava
commited on
Feb 3
Training in progress, step 160
6a9c1a2
verified
llavallava
commited on
Feb 3
Training in progress, step 140
fea0575
verified
llavallava
commited on
Feb 3
Training in progress, step 120
d19bde6
verified
llavallava
commited on
Feb 3
Training in progress, step 100
cfa3ca3
verified
llavallava
commited on
Feb 3
Training in progress, step 80
5245b84
verified
llavallava
commited on
Feb 3
Training in progress, step 60
63fb4dd
verified
llavallava
commited on
Feb 3
Training in progress, step 40
14d1bd1
verified
llavallava
commited on
Feb 3
Training in progress, step 20
d090b8a
verified
llavallava
commited on
Feb 3
initial commit
8adf8c1
verified
llavallava
commited on
Feb 3