bartowski
/

Llama-3.1-Nemotron-70B-Instruct-HF-GGUF

Text Generation

Model card Files Files and versions

Resources

View closed (1)

Compatible small models for speculative decoding?

#9 opened 7 months ago by

How many GPU ram needed?

#8 opened 8 months ago by

q8 with 8 part

#7 opened 9 months ago by

Q6_K vs. Q5_K_L

#6 opened 9 months ago by

IQ2_S

#4 opened 9 months ago by

Unable to pull in from Ollama

#3 opened 9 months ago by

Observation: 4-bit quantization can't answer the Strawberry prompt

#2 opened 9 months ago by

Nemotron 51B too please

#1 opened 9 months ago by