Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

hxssgaa
/
llama-3-8b-dpo-full

Text Generation
Transformers
TensorBoard
Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Model card Files Files and versions Metrics Training metrics Community
llama-3-8b-dpo-full / runs
Ctrl+K
Ctrl+K
  • 1 contributor
History: 9 commits
hxssgaa's picture
hxssgaa
End of training
0198ea9 verified 8 months ago
  • Oct07_16-40-24_a2ap-dgx001
    Training in progress, step 100 8 months ago
  • Oct07_16-47-37_a2ap-dgx001
    End of training 8 months ago
  • Oct08_11-20-18_a2ap-dgx018
    Training in progress, step 100 8 months ago
  • Oct08_11-25-31_a2ap-dgx018
    End of training 8 months ago