nvidia
/

Llama-3.1-Minitron-4B-Width-Base

Text Generation

text-generation-inference

Model card Files Files and versions

Resources

View closed (5)

Update metadata

#14 opened 5 months ago by

size mismatch

#12 opened 8 months ago by

Are there any plans to publish a version of the model with only pruning and no distillation?

#11 opened 9 months ago by

any chance of pruned 70b?

#10 opened 9 months ago by

Update config.json

#8 opened 9 months ago by

i could load the model but not create an inference

#7 opened 9 months ago by

Weight Error in Notebook

#5 opened 10 months ago by

Impact on effective context size ?

#3 opened 10 months ago by

Is Instruct 4B-Width also going to be published?

#1 opened 10 months ago by