Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
YeongminKim
/
zephyr-7b-dpo-full
like
0
Text Generation
Transformers
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
zephyr-7b-dpo-full
/
runs
Ctrl+K
Ctrl+K
1 contributor
History:
15 commits
YeongminKim
End of training
f7ae256
verified
5 months ago
Apr18_19-31-32_aai-a1003
Training in progress, step 100
5 months ago
Apr18_20-10-45_aai-a1003
Training in progress, step 100
5 months ago
Apr18_20-13-32_aai-a1003
Training in progress, step 100
5 months ago
Apr18_20-30-11_aai-a1003
Training in progress, step 100
5 months ago
Apr18_20-35-34_aai-a1003
Training in progress, step 100
5 months ago
Apr18_20-50-00_aai-a1003
Training in progress, step 100
5 months ago
Apr18_20-52-04_aai-a1003
Training in progress, step 100
5 months ago
Apr18_20-57-30_aai-a1003
Training in progress, step 100
5 months ago
Apr18_21-00-53_aai-a1003
Training in progress, step 100
5 months ago
Apr18_21-02-27_aai-a1003
End of training
5 months ago
Apr19_06-49-49_aai-a1003
End of training
5 months ago
Apr19_07-04-57_aai-a1003
End of training
5 months ago