Update README.md
Browse files
README.md
CHANGED
@@ -14,6 +14,8 @@ tags:
|
|
14 |
- pytorch
|
15 |
---
|
16 |
|
|
|
|
|
17 |
# Llama-3.1-Nemotron-Ultra-253B-v1
|
18 |
|
19 |
## Model Overview
|
|
|
14 |
- pytorch
|
15 |
---
|
16 |
|
17 |
+
EXL3 quant for 3.6BPW. It fits into 128GB, abeit with very limited context.
|
18 |
+
|
19 |
# Llama-3.1-Nemotron-Ultra-253B-v1
|
20 |
|
21 |
## Model Overview
|