Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

nvidia
/
Llama-3.1-Minitron-4B-Width-Base

Text Generation
Transformers
NeMo
Safetensors
PyTorch
English
llama
nvidia
llama-3
text-generation-inference
Model card Files Files and versions
xet
Community
14
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Update metadata

#14 opened 5 months ago by
pcuenq

size mismatch

πŸ‘€ 1
#12 opened 8 months ago by
Bleking

Are there any plans to publish a version of the model with only pruning and no distillation?

#11 opened 9 months ago by
kurogane

any chance of pruned 70b?

πŸ‘ 1
#10 opened 9 months ago by
pszemraj

Update config.json

1
#8 opened 9 months ago by
deshpandeabhi

i could load the model but not create an inference

#7 opened 9 months ago by
rijotomjackson

Weight Error in Notebook

8
#5 opened 10 months ago by
atharvanighot

Impact on effective context size ?

πŸ‘ 1
#3 opened 10 months ago by
BernardH

Is Instruct 4B-Width also going to be published?

πŸ‘ 2
2
#1 opened 10 months ago by
Qubitium
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs