Upload README.md with huggingface_hub
Browse files
README.md
ADDED
@@ -0,0 +1,56 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: other
|
3 |
+
language:
|
4 |
+
- en
|
5 |
+
pipeline_tag: text-generation
|
6 |
+
inference: false
|
7 |
+
tags:
|
8 |
+
- transformers
|
9 |
+
- gguf
|
10 |
+
- imatrix
|
11 |
+
- Silicon-Maid-7B
|
12 |
+
---
|
13 |
+
Quantizations of https://huggingface.co/SanjiWatsuki/Silicon-Maid-7B
|
14 |
+
|
15 |
+
### Experiment
|
16 |
+
|
17 |
+
Quants **ending in "_X"** are experimental quants. These quants are the same as normal quants, but their token embedding weights are set to Q8_0 except for Q6_K and Q8_0 which are set to F16. The change will make these experimental quants larger but in theory, should result in improved performance.
|
18 |
+
|
19 |
+
List of experimental quants:
|
20 |
+
* Q2_K_X
|
21 |
+
* Q4_K_M_X
|
22 |
+
* Q5_K_M_X
|
23 |
+
* Q6_K_X
|
24 |
+
* Q8_0_X
|
25 |
+
|
26 |
+
---
|
27 |
+
|
28 |
+
### Inference Clients/UIs
|
29 |
+
* [llama.cpp](https://github.com/ggerganov/llama.cpp)
|
30 |
+
* [JanAI](https://github.com/janhq/jan)
|
31 |
+
* [KoboldCPP](https://github.com/LostRuins/koboldcpp)
|
32 |
+
* [text-generation-webui](https://github.com/oobabooga/text-generation-webui)
|
33 |
+
* [ollama](https://github.com/ollama/ollama)
|
34 |
+
* [exllamav2](https://github.com/turboderp/exllamav2)
|
35 |
+
|
36 |
+
---
|
37 |
+
|
38 |
+
# From original readme
|
39 |
+
|
40 |
+
Silicon-Maid-7B is another model targeted at being both strong at RP **and** being a smart cookie that can follow character cards very well. As of right now, Silicon-Maid-7B outscores both of my previous 7B RP models in my RP benchmark and I have been impressed by this model's creativity. It is suitable for RP/ERP and general use.
|
41 |
+
|
42 |
+
### Prompt Template (Alpaca)
|
43 |
+
I found the best SillyTavern results from using the Noromaid template but please try other templates! Let me know if you find anything good.
|
44 |
+
|
45 |
+
SillyTavern config files: [Context](https://files.catbox.moe/ifmhai.json), [Instruct](https://files.catbox.moe/ttw1l9.json).
|
46 |
+
|
47 |
+
Additionally, here is my highly recommended [Text Completion preset](https://huggingface.co/SanjiWatsuki/Loyal-Macaroni-Maid-7B/blob/main/Characters/MinP.json). You can tweak this by adjusting temperature up or dropping min p to boost creativity or raise min p to increase stability. You shouldn't need to touch anything else!
|
48 |
+
|
49 |
+
```
|
50 |
+
Below is an instruction that describes a task. Write a response that appropriately completes the request.
|
51 |
+
|
52 |
+
### Instruction:
|
53 |
+
{prompt}
|
54 |
+
|
55 |
+
### Response:
|
56 |
+
```
|