Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

tanliboy
/
lambda-llama-3-8b-dpo-test

Text Generation
Transformers
TensorBoard
Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Model card Files Files and versions Metrics Training metrics Community
lambda-llama-3-8b-dpo-test / runs /Sep18_05-34-48_action-graph-trainer
43.5 kB
  • 1 contributor
History: 2 commits
tanliboy's picture
tanliboy
End of training
a320a9a verified about 1 year ago
  • events.out.tfevents.1726638443.action-graph-trainer.2590949.0
    42.6 kB
    LFS
    Model save about 1 year ago
  • events.out.tfevents.1726645681.action-graph-trainer.2590949.1
    828 Bytes
    LFS
    End of training about 1 year ago