Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nvidia
/
gpt3-8b-multi-3.5t-base
like
7
Follow
NVIDIA
4.98k
Text Generation
English
Megatron-LM
nvidia
Mamba
Mamba-2
SSM
8B
arxiv:
2406.07887
arxiv:
2405.21060
License:
apache-2.0
Model card
Files
Files and versions
Community
1
main
gpt3-8b-multi-3.5t-base
1 contributor
History:
3 commits
rwaleffe
Update model arguments
51d7f04
6 months ago
release
Update model arguments
6 months ago
.gitattributes
Safe
1.52 kB
initial commit
6 months ago
README.md
Safe
2.18 kB
Upload model
6 months ago
latest_checkpointed_iteration.txt
Safe
8 Bytes
Upload model
6 months ago
mt_nlg_plus_multilingual_ja_zh_the_stack_frac_015_256k.model
Safe
4.57 MB
LFS
Upload model
6 months ago