Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

NeelNanda
/
Attn-Only-2L512W-Shortformer-6B-big-lr

Transformers
Model card Files Files and versions Community
Attn-Only-2L512W-Shortformer-6B-big-lr
Ctrl+K
Ctrl+K
  • 1 contributor
History: 4 commits
NeelNanda's picture
NeelNanda
Update config.json
438f8e1 almost 3 years ago
  • checkpoints
    Auto Commit almost 3 years ago
  • .gitattributes
    1.38 kB
    initial commit almost 3 years ago
  • config.json
    1.25 kB
    Update config.json almost 3 years ago
  • model_final.pth
    219 MB
    LFS
    Auto Commit almost 3 years ago
  • model_init.pth
    219 MB
    LFS
    Auto Commit almost 3 years ago
  • optimizer_state_dict.pth
    433 MB
    LFS
    Auto Commit almost 3 years ago
  • scheduler_state_dict.pth
    751 Bytes
    LFS
    Auto Commit almost 3 years ago