Edit Models filters

Model Tree

nvidia/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct

Apps

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

16

Full-text search

Active filters: nvidia/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct

mradermacher/Llama-3.1-8B-UltraLong-2M-Instruct-GGUF

8B • Updated Apr 18 • 13 • 1

m-i/Llama-3.1-8B-UltraLong-2M-Instruct-mlx-8Bit

Text Generation • 2B • Updated Apr 9 • 5

mradermacher/Llama-3.1-8B-UltraLong-2M-Instruct-i1-GGUF

8B • Updated Apr 18 • 50 • 1

lmstudio-community/Llama-3.1-8B-UltraLong-2M-Instruct-GGUF

Text Generation • 8B • Updated Apr 14 • 15

bartowski/nvidia_Llama-3.1-8B-UltraLong-2M-Instruct-GGUF

Text Generation • 8B • Updated Apr 14 • 1.33k • 1

Triangle104/Llama-3.1-8B-UltraLong-2M-Instruct-Q4_K_S-GGUF

8B • Updated Apr 15 • 4

Triangle104/Llama-3.1-8B-UltraLong-2M-Instruct-Q4_K_M-GGUF

8B • Updated Apr 15 • 5

Triangle104/Llama-3.1-8B-UltraLong-2M-Instruct-Q5_K_S-GGUF

8B • Updated Apr 15 • 3

Triangle104/Llama-3.1-8B-UltraLong-2M-Instruct-Q5_K_M-GGUF

8B • Updated Apr 15 • 1

Triangle104/Llama-3.1-8B-UltraLong-2M-Instruct-Q6_K-GGUF

8B • Updated Apr 15 • 3

Triangle104/Llama-3.1-8B-UltraLong-2M-Instruct-Q8_0-GGUF

8B • Updated Apr 15 • 2

DevQuasar/nvidia.Llama-3.1-8B-UltraLong-2M-Instruct-GGUF

Text Generation • 8B • Updated Apr 16 • 7

mradermacher/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct-GGUF

8B • Updated Apr 17 • 21 • 1

mradermacher/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct-i1-GGUF

8B • Updated Apr 17 • 31 • 1

itlwas/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct-Q4_K_M-GGUF

8B • Updated Apr 19 • 4

Blasserman/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct-Q4_K_M-GGUF

8B • Updated Apr 27