Upload 8 files
Browse files- .gitattributes +7 -0
- README.md +100 -0
- Tofu-Q2_K.gguf +3 -0
- Tofu-Q3_K.gguf +3 -0
- Tofu-Q4_K.gguf +3 -0
- Tofu-Q4_K_S.gguf +3 -0
- Tofu-Q5_K.gguf +3 -0
- Tofu-Q6_K.gguf +3 -0
- Tofu-f16.gguf +3 -0
.gitattributes
CHANGED
@@ -33,3 +33,10 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
+
Tofu-f16.gguf filter=lfs diff=lfs merge=lfs -text
|
37 |
+
Tofu-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
|
38 |
+
Tofu-Q3_K.gguf filter=lfs diff=lfs merge=lfs -text
|
39 |
+
Tofu-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
40 |
+
Tofu-Q4_K.gguf filter=lfs diff=lfs merge=lfs -text
|
41 |
+
Tofu-Q5_K.gguf filter=lfs diff=lfs merge=lfs -text
|
42 |
+
Tofu-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
README.md
ADDED
@@ -0,0 +1,100 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
base_model:
|
3 |
+
- jeiku/Rosa_v1_3B
|
4 |
+
- jeiku/Theory_of_Mind_128_StableLM
|
5 |
+
- jeiku/Rosa_v1_3B
|
6 |
+
- jeiku/Rosa_v1_3B
|
7 |
+
- jeiku/PIPPA_128_StableLM
|
8 |
+
- jeiku/Rosa_v1_3B
|
9 |
+
- jeiku/LimaRP_StableLM
|
10 |
+
- jeiku/Rosa_v1_3B
|
11 |
+
- jeiku/Theory_of_Mind_RP_128_StableLM
|
12 |
+
- jeiku/Rosa_v1_3B
|
13 |
+
- jeiku/No_Robots_Alpaca_StableLM
|
14 |
+
- jeiku/Rosa_v1_3B
|
15 |
+
- jeiku/Alpaca_128_StableLM
|
16 |
+
- jeiku/Rosa_v1_3B
|
17 |
+
- jeiku/Everything_v3_128_StableLM
|
18 |
+
- jeiku/Rosa_v1_3B
|
19 |
+
- jeiku/RPGPT_StableLM
|
20 |
+
- jeiku/Rosa_v1_3B
|
21 |
+
- jeiku/Toxic_DPO_StableLM
|
22 |
+
- jeiku/Rosa_v1_3B
|
23 |
+
- jeiku/Gnosis_256_StableLM
|
24 |
+
- jeiku/Rosa_v1_3B
|
25 |
+
- jeiku/Bluemoon_cleaned_StableLM
|
26 |
+
tags:
|
27 |
+
- mergekit
|
28 |
+
- merge
|
29 |
+
|
30 |
+
---
|
31 |
+
# bigmix
|
32 |
+
|
33 |
+
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
34 |
+
|
35 |
+
## Merge Details
|
36 |
+
### Merge Method
|
37 |
+
|
38 |
+
This model was merged using the [task arithmetic](https://arxiv.org/abs/2212.04089) merge method using [jeiku/Rosa_v1_3B](https://huggingface.co/jeiku/Rosa_v1_3B) as a base.
|
39 |
+
|
40 |
+
### Models Merged
|
41 |
+
|
42 |
+
The following models were included in the merge:
|
43 |
+
* [jeiku/Rosa_v1_3B](https://huggingface.co/jeiku/Rosa_v1_3B) + [jeiku/Theory_of_Mind_128_StableLM](https://huggingface.co/jeiku/Theory_of_Mind_128_StableLM)
|
44 |
+
* [jeiku/Rosa_v1_3B](https://huggingface.co/jeiku/Rosa_v1_3B) + [jeiku/PIPPA_128_StableLM](https://huggingface.co/jeiku/PIPPA_128_StableLM)
|
45 |
+
* [jeiku/Rosa_v1_3B](https://huggingface.co/jeiku/Rosa_v1_3B) + [jeiku/LimaRP_StableLM](https://huggingface.co/jeiku/LimaRP_StableLM)
|
46 |
+
* [jeiku/Rosa_v1_3B](https://huggingface.co/jeiku/Rosa_v1_3B) + [jeiku/Theory_of_Mind_RP_128_StableLM](https://huggingface.co/jeiku/Theory_of_Mind_RP_128_StableLM)
|
47 |
+
* [jeiku/Rosa_v1_3B](https://huggingface.co/jeiku/Rosa_v1_3B) + [jeiku/No_Robots_Alpaca_StableLM](https://huggingface.co/jeiku/No_Robots_Alpaca_StableLM)
|
48 |
+
* [jeiku/Rosa_v1_3B](https://huggingface.co/jeiku/Rosa_v1_3B) + [jeiku/Alpaca_128_StableLM](https://huggingface.co/jeiku/Alpaca_128_StableLM)
|
49 |
+
* [jeiku/Rosa_v1_3B](https://huggingface.co/jeiku/Rosa_v1_3B) + [jeiku/Everything_v3_128_StableLM](https://huggingface.co/jeiku/Everything_v3_128_StableLM)
|
50 |
+
* [jeiku/Rosa_v1_3B](https://huggingface.co/jeiku/Rosa_v1_3B) + [jeiku/RPGPT_StableLM](https://huggingface.co/jeiku/RPGPT_StableLM)
|
51 |
+
* [jeiku/Rosa_v1_3B](https://huggingface.co/jeiku/Rosa_v1_3B) + [jeiku/Toxic_DPO_StableLM](https://huggingface.co/jeiku/Toxic_DPO_StableLM)
|
52 |
+
* [jeiku/Rosa_v1_3B](https://huggingface.co/jeiku/Rosa_v1_3B) + [jeiku/Gnosis_256_StableLM](https://huggingface.co/jeiku/Gnosis_256_StableLM)
|
53 |
+
* [jeiku/Rosa_v1_3B](https://huggingface.co/jeiku/Rosa_v1_3B) + [jeiku/Bluemoon_cleaned_StableLM](https://huggingface.co/jeiku/Bluemoon_cleaned_StableLM)
|
54 |
+
|
55 |
+
### Configuration
|
56 |
+
|
57 |
+
The following YAML configuration was used to produce this model:
|
58 |
+
|
59 |
+
```yaml
|
60 |
+
merge_method: task_arithmetic
|
61 |
+
base_model: jeiku/Rosa_v1_3B
|
62 |
+
parameters:
|
63 |
+
normalize: true
|
64 |
+
models:
|
65 |
+
- model: jeiku/Rosa_v1_3B+jeiku/No_Robots_Alpaca_StableLM
|
66 |
+
parameters:
|
67 |
+
weight: 0.5
|
68 |
+
- model: jeiku/Rosa_v1_3B+jeiku/Toxic_DPO_StableLM
|
69 |
+
parameters:
|
70 |
+
weight: 0.5
|
71 |
+
- model: jeiku/Rosa_v1_3B+jeiku/Alpaca_128_StableLM
|
72 |
+
parameters:
|
73 |
+
weight: 0.4
|
74 |
+
- model: jeiku/Rosa_v1_3B+jeiku/Everything_v3_128_StableLM
|
75 |
+
parameters:
|
76 |
+
weight: 0.4
|
77 |
+
- model: jeiku/Rosa_v1_3B+jeiku/Gnosis_256_StableLM
|
78 |
+
parameters:
|
79 |
+
weight: 1
|
80 |
+
- model: jeiku/Rosa_v1_3B+jeiku/Theory_of_Mind_128_StableLM
|
81 |
+
parameters:
|
82 |
+
weight: 0.8
|
83 |
+
- model: jeiku/Rosa_v1_3B+jeiku/PIPPA_128_StableLM
|
84 |
+
parameters:
|
85 |
+
weight: 0.4
|
86 |
+
- model: jeiku/Rosa_v1_3B+jeiku/LimaRP_StableLM
|
87 |
+
parameters:
|
88 |
+
weight: 0.7
|
89 |
+
- model: jeiku/Rosa_v1_3B+jeiku/Theory_of_Mind_RP_128_StableLM
|
90 |
+
parameters:
|
91 |
+
weight: 0.6
|
92 |
+
- model: jeiku/Rosa_v1_3B+jeiku/Bluemoon_cleaned_StableLM
|
93 |
+
parameters:
|
94 |
+
weight: 0.8
|
95 |
+
- model: jeiku/Rosa_v1_3B+jeiku/RPGPT_StableLM
|
96 |
+
parameters:
|
97 |
+
weight: 0.4
|
98 |
+
dtype: float16
|
99 |
+
|
100 |
+
```
|
Tofu-Q2_K.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c83f8ff8f71e600a52a34c457acefcc6b3afb719ec699edcb72ba5e6609189cd
|
3 |
+
size 1083755840
|
Tofu-Q3_K.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:51766581c0e7363f54860ea21f51db8c1de7c7267b8a9f3298d22d2f3d9b1993
|
3 |
+
size 1391419200
|
Tofu-Q4_K.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:bd41dad14ec27e557b183479c66f7885efe1f1bb57d46f0c2f930c8d25ff1d7a
|
3 |
+
size 1708595520
|
Tofu-Q4_K_S.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f57cc85a3de8fb139f8f89e5867a5ea092b8385c1f2c5fa144af7acaf28b372f
|
3 |
+
size 1620695360
|
Tofu-Q5_K.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:8f8901491b52f2c09c19d0c929a866201c6a19733a2c695bf183b24756ca753a
|
3 |
+
size 1993390400
|
Tofu-Q6_K.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5b3a5c1a9e8367d160bbf33c4ddeace50199c8938256abf41eb5d59a9124b6c7
|
3 |
+
size 2295984960
|
Tofu-f16.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5e21d55aece0f125a6663aa02117a8bb962c9f8ef792f4fb82d367bed2002c75
|
3 |
+
size 5593341696
|