Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
NeelNanda
/
Attn-Only-2L512W-Shortformer-6B-big-lr
like
0
Transformers
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Attn-Only-2L512W-Shortformer-6B-big-lr
Ctrl+K
Ctrl+K
1 contributor
History:
4 commits
NeelNanda
Update config.json
438f8e1
almost 3 years ago
checkpoints
Auto Commit
almost 3 years ago
.gitattributes
Safe
1.38 kB
initial commit
almost 3 years ago
config.json
Safe
1.25 kB
Update config.json
almost 3 years ago
model_final.pth
Safe
219 MB
LFS
Auto Commit
almost 3 years ago
model_init.pth
Safe
219 MB
LFS
Auto Commit
almost 3 years ago
optimizer_state_dict.pth
Safe
433 MB
LFS
Auto Commit
almost 3 years ago
scheduler_state_dict.pth
Safe
751 Bytes
LFS
Auto Commit
almost 3 years ago