cgus commited on
Commit
ac55a6a
·
verified ·
1 Parent(s): 17b8a86

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -1
README.md CHANGED
@@ -1,7 +1,25 @@
1
  ---
2
  license: mit
 
 
 
3
  ---
4
-
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5
  # Apriel-Nemotron-15b-Thinker
6
 
7
  <img src="https://cdn-uploads.huggingface.co/production/uploads/63d3095c2727d7888cbb54e2/Lt1t0tOO5emz1X23Azg-E.png" width="120" alt="thumbnail"/> `/ˈɑː.pri.əl/`
 
1
  ---
2
  license: mit
3
+ base_model:
4
+ - ServiceNow-AI/Apriel-Nemotron-15b-Thinker
5
+ library_name: exllamav2
6
  ---
7
+ # Apriel-Nemotron-15b-Thinker-exl2
8
+ Original model: [Apriel-Nemotron-15b-Thinker](https://huggingface.co/ServiceNow-AI/Apriel-Nemotron-15b-Thinker) by [ServiceNow-AI](https://huggingface.co/ServiceNow-AI)
9
+
10
+ ## Quants
11
+ [4bpw h6 (main)](https://huggingface.co/cgus/Apriel-Nemotron-15b-Thinker-exl2/tree/main)
12
+ [4.5bpw h6](https://huggingface.co/cgus/Apriel-Nemotron-15b-Thinker-exl2/tree/4.5bpw-h6)
13
+ [5bpw h6](https://huggingface.co/cgus/Apriel-Nemotron-15b-Thinker-exl2/tree/5bpw-h6)
14
+ [6bpw h6](https://huggingface.co/cgus/Apriel-Nemotron-15b-Thinker-exl2/tree/6bpw-h6)
15
+ [8bpw h8](https://huggingface.co/cgus/Apriel-Nemotron-15b-Thinker-exl2/tree/8bpw-h8)
16
+
17
+ ## Quantization notes
18
+ Made with Exllamav2 0.2.9 dev with default dataset.
19
+ It can be used with RTX GPU (Windows) or RTX/ROCm (Linux) with TabbyAPI or Text-Generation-WebUI.
20
+ It seems this model uses unusual thinking tags that aren't automatically recognized by frontends.
21
+
22
+ ## Original model card
23
  # Apriel-Nemotron-15b-Thinker
24
 
25
  <img src="https://cdn-uploads.huggingface.co/production/uploads/63d3095c2727d7888cbb54e2/Lt1t0tOO5emz1X23Azg-E.png" width="120" alt="thumbnail"/> `/ˈɑː.pri.əl/`