Darkhn commited on
Commit
c33cd14
·
verified ·
1 Parent(s): 79be38c

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +79 -3
README.md CHANGED
@@ -1,3 +1,79 @@
1
- ---
2
- license: llama3.3
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - nbeerbower/Llama-3.1-Nemotron-lorablated-70B
4
+ - ArliAI/Llama-3.1-70B-ArliAI-RPMax-v1.3
5
+ - TheDrummer/Fallen-Llama-3.3-R1-70B-v1
6
+ - EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.1
7
+ - SicariusSicariiStuff/Negative_LLAMA_70B
8
+ - LatitudeGames/Wayfarer-Large-70B-Llama-3.3
9
+ library_name: transformers
10
+ tags:
11
+ - mergekit
12
+ - merge
13
+ license: llama3.3
14
+ ---
15
+
16
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64909c086073a0cd172d0411/KK8hCbd8KXy03cP_qzdij.png)
17
+
18
+ V2 of Dungeonmaster, I decided to move away from the R1 base here, because I feel it the pros dont necessarily outweigh the cons. For V2 I decided to go for the classic nbeerbower/Llama-3.1-Nemotron-lorablated-70B as the base.
19
+ Dungeonmaster is meant to be specifically for creative roleplays with stakes and consequences using the following curated models:
20
+
21
+ - LatitudeGames/Wayfarer-Large-70B-Llama-3.3 - A fine-tuned model specifically designed for this very application.
22
+ - ArliAI/Llama-3.1-70B-ArliAI-RPMax-v1.3 - Another fine-tuned trained on RP datasets.
23
+ - EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.1 - For it's strong descriptive writing.
24
+ - SicariusSicariiStuff/Negative_LLAMA_70B - To assist with the darker undertones.
25
+ - TheDrummer/Fallen-Llama-3.3-R1-70B-v1 - The secret sauce, a completely unhinged thinking model that turns things up to 11.
26
+
27
+ # merge
28
+
29
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
30
+
31
+ ## Merge Details
32
+ ### Merge Method
33
+
34
+ This model was merged using the [Linear DELLA](https://arxiv.org/abs/2406.11617) merge method using [nbeerbower/Llama-3.1-Nemotron-lorablated-70B](https://huggingface.co/nbeerbower/Llama-3.1-Nemotron-lorablated-70B) as a base.
35
+
36
+ ### Models Merged
37
+
38
+ The following models were included in the merge:
39
+ * [ArliAI/Llama-3.1-70B-ArliAI-RPMax-v1.3](https://huggingface.co/ArliAI/Llama-3.1-70B-ArliAI-RPMax-v1.3)
40
+ * [TheDrummer/Fallen-Llama-3.3-R1-70B-v1](https://huggingface.co/TheDrummer/Fallen-Llama-3.3-R1-70B-v1)
41
+ * [EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.1](https://huggingface.co/EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.1)
42
+ * [SicariusSicariiStuff/Negative_LLAMA_70B](https://huggingface.co/SicariusSicariiStuff/Negative_LLAMA_70B)
43
+ * [LatitudeGames/Wayfarer-Large-70B-Llama-3.3](https://huggingface.co/LatitudeGames/Wayfarer-Large-70B-Llama-3.3)
44
+
45
+ ### Configuration
46
+
47
+ The following YAML configuration was used to produce this model:
48
+
49
+ ```yaml
50
+ models:
51
+ - model: LatitudeGames/Wayfarer-Large-70B-Llama-3.3
52
+ parameters:
53
+ weight: 0.20
54
+ density: 0.7
55
+ - model: ArliAI/Llama-3.1-70B-ArliAI-RPMax-v1.3
56
+ parameters:
57
+ weight: 0.20
58
+ density: 0.7
59
+ - model: EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.1
60
+ parameters:
61
+ weight: 0.20
62
+ density: 0.7
63
+ - model: SicariusSicariiStuff/Negative_LLAMA_70B
64
+ parameters:
65
+ weight: 0.20
66
+ density: 0.7
67
+ - model: TheDrummer/Fallen-Llama-3.3-R1-70B-v1
68
+ parameters:
69
+ weight: 0.20
70
+ density: 0.7
71
+ merge_method: della_linear
72
+ base_model: nbeerbower/Llama-3.1-Nemotron-lorablated-70B
73
+ parameters:
74
+ epsilon: 0.2
75
+ lambda: 1.1
76
+ dtype: bfloat16
77
+ tokenizer:
78
+ source: nbeerbower/Llama-3.1-Nemotron-lorablated-70B
79
+ ```