Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nvidia
/
gpt3-8b-multi-3.5t-base
like
8
Follow
NVIDIA
22.4k
Text Generation
English
Megatron-LM
nvidia
Mamba
Mamba-2
SSM
8B
arxiv:
2406.07887
arxiv:
2405.21060
License:
apache-2.0
Model card
Files
Files and versions
Community
1
main
gpt3-8b-multi-3.5t-base
Ctrl+K
Ctrl+K
1 contributor
History:
3 commits
rwaleffe
Update model arguments
51d7f04
10 months ago
release
.gitattributes
Safe
1.52 kB
README.md
Safe
2.18 kB
latest_checkpointed_iteration.txt
Safe
8 Bytes
mt_nlg_plus_multilingual_ja_zh_the_stack_frac_015_256k.model
Safe
4.57 MB
LFS