Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

nnheui
/
pythia-1.4b-dpo-full

Text Generation
Transformers
TensorBoard
Safetensors
gpt_neox
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Model card Files Files and versions
xet
Metrics Training metrics Community
pythia-1.4b-dpo-full / runs
Ctrl+K
Ctrl+K
  • 1 contributor
History: 70 commits
nnheui's picture
nnheui
End of training
1a33399 verified about 1 year ago
  • Jul08_01-33-05_42dbe5cf9ed4
    Training in progress, step 500 about 1 year ago
  • Jul08_06-23-29_42dbe5cf9ed4
    End of training about 1 year ago
  • Jul08_11-47-26_42dbe5cf9ed4
    End of training about 1 year ago
  • Jul08_12-10-46_42dbe5cf9ed4
    End of training about 1 year ago
  • Jul08_16-05-34_42dbe5cf9ed4
    End of training about 1 year ago
  • Mar16_17-07-37_42dbe5cf9ed4
    Model save over 1 year ago