Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
winnieyangwannan
/
Yi-6B-Chat_honest_lying_dpo_to_lie_lora_True
like
0
Transformers
Safetensors
Generated from Trainer
Yi-6B-Chat
honest_lying
dpo_to_lie
lora_True
trl
dpo
arxiv:
2305.18290
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Yi-6B-Chat_honest_lying_dpo_to_lie_lora_True
Commit History
End of training
9a9db74
verified
winnieyangwannan
commited on
Feb 18
Model save
25cb3e9
verified
winnieyangwannan
commited on
Feb 18
Training in progress, step 2760, checkpoint
754904f
verified
winnieyangwannan
commited on
Feb 18
Training in progress, step 2760
338dd49
verified
winnieyangwannan
commited on
Feb 18
Training in progress, step 2500, checkpoint
1068eb8
verified
winnieyangwannan
commited on
Feb 18
Training in progress, step 2500
7b8e9f7
verified
winnieyangwannan
commited on
Feb 18
Training in progress, step 2000, checkpoint
7a99997
verified
winnieyangwannan
commited on
Feb 18
Training in progress, step 2000
59a684a
verified
winnieyangwannan
commited on
Feb 18
Training in progress, step 1500, checkpoint
a7acce3
verified
winnieyangwannan
commited on
Feb 18
Training in progress, step 1500
b59f2db
verified
winnieyangwannan
commited on
Feb 18
Training in progress, step 1000, checkpoint
22901bc
verified
winnieyangwannan
commited on
Feb 18
Training in progress, step 1000
a57617f
verified
winnieyangwannan
commited on
Feb 18
Training in progress, step 500, checkpoint
39da246
verified
winnieyangwannan
commited on
Feb 18
Training in progress, step 500
f091a9f
verified
winnieyangwannan
commited on
Feb 18
End of training
28e05a4
verified
winnieyangwannan
commited on
Feb 18
initial commit
639b7fb
verified
winnieyangwannan
commited on
Feb 18