Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

bartowski
/
Llama-3.1-Nemotron-70B-Instruct-HF-GGUF

Text Generation
GGUF
English
nvidia
llama3.1
imatrix
conversational
Model card Files Files and versions
xet
Community
9
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Compatible small models for speculative decoding?

βž• 1
#9 opened 5 months ago by
treehugg3

How many GPU ram needed?

1
#8 opened 6 months ago by
RaidXD

q8 with 8 part

#7 opened 7 months ago by
sdyy

Q6_K vs. Q5_K_L

3
#6 opened 7 months ago by
AIGUYCONTENT

IQ2_S

#4 opened 7 months ago by
SekkSea

Unable to pull in from Ollama

5
#3 opened 7 months ago by
AIGUYCONTENT

Observation: 4-bit quantization can't answer the Strawberry prompt

πŸ‘€ 1
12
#2 opened 7 months ago by
ThePabli

Nemotron 51B too please

πŸ‘ 8
4
#1 opened 7 months ago by
nacs
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs