Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Model Tree
Reset
nvidia/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct
Quantizations
Merges
Inference Providers
Select all
fal
Cohere
Nscale
Fireworks
Novita
Together AI
Nebius AI Studio
Replicate
Cerebras
Featherless AI
Hyperbolic
SambaNova
HF Inference API
Misc
Inference Endpoints
text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
15
Full-text search
Edit filters
Sort: Trending
Active filters:
nvidia/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct
Clear all
mradermacher/Llama-3.1-8B-UltraLong-2M-Instruct-GGUF
Updated
Apr 18
•
33
•
1
m-i/Llama-3.1-8B-UltraLong-2M-Instruct-mlx-8Bit
Text Generation
•
Updated
Apr 9
•
18
mradermacher/Llama-3.1-8B-UltraLong-2M-Instruct-i1-GGUF
Updated
Apr 18
•
68
•
1
lmstudio-community/Llama-3.1-8B-UltraLong-2M-Instruct-GGUF
Text Generation
•
Updated
Apr 14
•
23
Triangle104/Llama-3.1-8B-UltraLong-2M-Instruct-Q4_K_S-GGUF
Updated
Apr 15
•
9
Triangle104/Llama-3.1-8B-UltraLong-2M-Instruct-Q4_K_M-GGUF
Updated
Apr 15
•
5
Triangle104/Llama-3.1-8B-UltraLong-2M-Instruct-Q5_K_S-GGUF
Updated
Apr 15
•
16
Triangle104/Llama-3.1-8B-UltraLong-2M-Instruct-Q5_K_M-GGUF
Updated
Apr 15
•
18
Triangle104/Llama-3.1-8B-UltraLong-2M-Instruct-Q6_K-GGUF
Updated
Apr 15
•
9
Triangle104/Llama-3.1-8B-UltraLong-2M-Instruct-Q8_0-GGUF
Updated
Apr 15
•
21
DevQuasar/nvidia.Llama-3.1-8B-UltraLong-2M-Instruct-GGUF
Text Generation
•
Updated
Apr 16
•
72
mradermacher/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct-GGUF
Updated
Apr 17
•
39
•
1
mradermacher/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct-i1-GGUF
Updated
Apr 17
•
134
•
1
itlwas/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct-Q4_K_M-GGUF
Updated
Apr 19
•
1
Blasserman/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct-Q4_K_M-GGUF
Updated
Apr 27
•
2