Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

nnheui
/
pythia-1.4b-dpo-full

Text Generation
Transformers
TensorBoard
Safetensors
gpt_neox
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Model card Files Files and versions
xet
Metrics Training metrics Community
pythia-1.4b-dpo-full / runs
256 kB
  • 1 contributor
History: 19 commits
nnheui's picture
nnheui
Model save
c78836d verified over 1 year ago
  • Jul08_01-33-05_42dbe5cf9ed4
    Training in progress, step 500 over 1 year ago
  • Jul08_06-23-29_42dbe5cf9ed4
    Model save over 1 year ago
  • Mar16_17-07-37_42dbe5cf9ed4
    Model save over 1 year ago