Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
VoxCPM
Log In
Sign Up
nvidia
/
gpt3-8b-multi-3.5t-base
like
8
Follow
NVIDIA
39.7k
Text Generation
English
Megatron-LM
nvidia
Mamba
Mamba-2
SSM
8B
arxiv:
2406.07887
arxiv:
2405.21060
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
890e55a
gpt3-8b-multi-3.5t-base
/
release
/
mp_rank_00
17.1 GB
1 contributor
History:
1 commit
rwaleffe
Upload model
890e55a
over 1 year ago
model_optim_rng.pt
17.1 GB
xet
Upload model
over 1 year ago