Delta-Vector commited on
Commit
c97777b
·
verified ·
1 Parent(s): e5d4882

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +412 -20
README.md CHANGED
@@ -1,36 +1,428 @@
1
  ---
 
 
 
 
 
 
 
 
 
2
  base_model:
3
- - NewEden/Hamanasu-KTO-V2
4
- - NewEden/adventure-2-2025
5
- library_name: transformers
6
  tags:
7
- - mergekit
8
- - merge
9
-
 
10
  ---
11
- # merge
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
 
13
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
14
 
15
- ## Merge Details
16
- ### Merge Method
17
 
18
- This model was merged using the Passthrough merge method using [NewEden/Hamanasu-KTO-V2](https://huggingface.co/NewEden/Hamanasu-KTO-V2) + [NewEden/adventure-2-2025](https://huggingface.co/NewEden/adventure-2-2025) as a base.
19
 
20
- ### Models Merged
 
 
 
 
21
 
22
- The following models were included in the merge:
23
 
24
 
25
- ### Configuration
26
 
27
- The following YAML configuration was used to produce this model:
28
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
29
  ```yaml
30
- base_model: NewEden/Hamanasu-KTO-V2+NewEden/adventure-2-2025
31
- dtype: bfloat16
32
- merge_method: passthrough
33
- models:
34
- - model: NewEden/Hamanasu-KTO-V2+NewEden/adventure-2-2025
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
35
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
36
  ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ thumbnail: "https://cdn-uploads.huggingface.co/production/uploads/66c26b6fb01b19d8c3c2467b/jg2NWmCUfPyzizm2USjMt.jpeg"
3
+ datasets:
4
+ - PocketDoc/Dans-Prosemaxx-Adventure
5
+ - PocketDoc/Dans-Failuremaxx-Adventure-3
6
+ - PocketDoc/Dans-Prosemaxx-InstructWriter-ZeroShot-2
7
+ - PocketDoc/Dans-Prosemaxx-InstructWriter-ZeroShot-3
8
+ - PocketDoc/Dans-Prosemaxx-InstructWriter-Continue-2
9
+ - PocketDoc/Dans-Prosemaxx-Instructwriter-Long
10
+ - PocketDoc/Dans-Personamaxx-VN
11
  base_model:
12
+ - Delta-Vector/Hamanasu-4B-Instruct-KTO-V2
 
 
13
  tags:
14
+ - llama
15
+ - roleplay
16
+ - finetune
17
+ - storywriting
18
  ---
19
+ <!DOCTYPE html>
20
+ <style>
21
+ html, body {
22
+ background: black;
23
+ color: #c9d1d9 !important;
24
+ font-family: 'Segoe UI', Tahoma, Geneva, Verdana, sans-serif;
25
+ margin: 0;
26
+ padding: 0;
27
+ min-height: 100vh;
28
+ }
29
+ .markdown-body {
30
+ color: white;
31
+ margin: 40px auto;
32
+ padding: 40px;
33
+ border-radius: 12px;
34
+ position: relative;
35
+ overflow: hidden;
36
+ }
37
+
38
+ .markdown-body::after {
39
+ content: '';
40
+ position: absolute;
41
+ top: 0;
42
+ left: 0;
43
+ width: 100%;
44
+ height: 100%;
45
+ background: #0c0f18; /* background color */
46
+ pointer-events: none;
47
+ z-index: -999;
48
+ }
49
+
50
+ h1, h2, h3 {
51
+ background: linear-gradient(45deg, #6e00ff, #00ffff);
52
+ -webkit-background-clip: text;
53
+ -webkit-text-fill-color: transparent;
54
+ border-bottom: 1px solid #333;
55
+ padding-bottom: 0.3em;
56
+ }
57
+
58
+ div[style*="border:2px solid #333"],
59
+ div[style*="border: 2px solid #333"],
60
+ div[style*="border:1px solid #333"],
61
+ div[style*="border: 1px solid #333"] {
62
+ background: rgba(22, 27, 34, 0.8) !important;
63
+ border: 2px solid #6e00ff !important;
64
+ box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);
65
+ border-radius: 10px;
66
+ padding: 20px;
67
+ margin: 20px 0;
68
+ }
69
+
70
+ code {
71
+ background-color: #1a1a1a !important;
72
+ border-radius: 4px;
73
+ padding: 0.2em 0.4em;
74
+ color: #00ffff;
75
+ }
76
+
77
+ pre {
78
+ background-color: #1a1a1a !important;
79
+ border: 1px solid #333;
80
+ border-radius: 8px;
81
+ padding: 16px;
82
+ }
83
+
84
+ table {
85
+ width: 100%;
86
+ border-collapse: collapse;
87
+ margin: 20px 0;
88
+ background: rgba(0,0,0,0.2);
89
+ table-layout: fixed;
90
+ color: white;
91
+ }
92
+
93
+ th, td {
94
+ border: 1px solid #333;
95
+ padding: 12px;
96
+ text-align: center;
97
+ color: white;
98
+ }
99
+
100
+ th {
101
+ background: rgba(110, 0, 255, 0.1);
102
+ }
103
+
104
+ td:nth-child(1) {
105
+ width: 1%;
106
+ white-space: nowrap;
107
+ }
108
+
109
+ td:nth-child(2) {
110
+ width: 100%;
111
+ }
112
+
113
+ td > span {
114
+ display: block;
115
+ padding: 4px 8px;
116
+ background: rgba(110, 0, 255, 0.1);
117
+ border-radius: 4px;
118
+ transition: all 0.3s ease;
119
+ }
120
+
121
+ td > span:hover {
122
+ background: rgba(110, 0, 255, 0.2);
123
+ transform: translateY(-1px);
124
+ }
125
+
126
+ a {
127
+ color: #00ffff;
128
+ text-decoration: none;
129
+ transition: all 0.3s ease;
130
+ }
131
+
132
+ a:hover {
133
+ color: #6e00ff;
134
+ text-decoration: none;
135
+ }
136
+
137
+ hr {
138
+ border: 0;
139
+ height: 1px;
140
+ background: linear-gradient(90deg, transparent, #333, transparent);
141
+ margin: 40px 0;
142
+ }
143
+
144
+ img {
145
+ max-width: 100%;
146
+ border-radius: 10px;
147
+ }
148
+
149
+ details summary:hover {
150
+ color: #00ffff;
151
+ }
152
+
153
+ * {
154
+ color-scheme: dark !important;
155
+ }
156
+
157
+ .prose, .max-w-none, .px-4 {
158
+ background-color: transparent !important;
159
+ color: #c9d1d9 !important;
160
+ }
161
+ </style>
162
+ <body>
163
+ <div class="markdown-body">
164
+ <div align="center">
165
+
166
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/66c26b6fb01b19d8c3c2467b/o5WjJKA9f95ri9UzRxZQE.png" alt="Model Visualization" width="500px" style="border: 3px solid #333; box-shadow: 0 0 15px rgba(66, 0, 131, 0.5);" />
167
+
168
+ <br>
169
+ <br>
170
+
171
+ <div style="font-size:1.5em; font-weight:bold; background: linear-gradient(45deg, #6e00ff, #00ffff); -webkit-background-clip: text; -webkit-text-fill-color: transparent;">
172
+ Hamanasu 4B
173
+ </div>
174
+
175
+ </div>
176
+
177
+ <div style="border:1px solid #333; border-radius:10px; padding:20px; margin:20px 0; background: rgba(0,0,0,0.4);">
178
+
179
+
180
+ ## 🌌 Overview
181
+
182
+ <i>Thanks to PocketDoc's Adventure datasets and taking his Dangerous Winds models as inspiration, I was able to finetune a small Adventure model that ***HATES*** the User</i>
183
+
184
+ <i>The model is suited for Text Adventure, All thanks to Tav for funding the train.</i>
185
+
186
+ <i>Support me and my finetunes on Ko-Fi https://ko-fi.com/deltavector</i>
187
+
188
+
189
+ </div>
190
 
191
+ <div style="display: grid; grid-template-columns: repeat(auto-fit, minmax(250px, 1fr)); gap: 20px; margin: 20px 0;">
192
 
 
 
193
 
194
+ <div style="border:2px solid #333; border-radius:10px; padding:20px; background: rgba(0,0,0,0.2);">
195
 
196
+ ### ⚔️ Hardware
197
+ - 8x H100s
198
+ - Epochs: 4
199
+ - Base: `Delta-Vector/Hamanasu-4B-Instruct-KTO-V2`
200
+ </div>
201
 
202
+ </div>
203
 
204
 
205
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
206
 
207
+ ## 💰 Prompting
208
 
209
+
210
+ <i>This model uses ChatML formatting(Names off)</i>
211
+ ```python
212
+ <|im_start|>system
213
+ You are an uncensored AI, your job is to fulfill thy will of thy user.<|im_end|>
214
+ <|im_start|>User request
215
+ Take off your helmet.<|im_end|>
216
+ <|im_start|>No i shall not. This is the way.
217
+ ```
218
+ </div>
219
+
220
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
221
+
222
+ ## 🎲 Recommended Sampler Preset
223
+
224
+ ```yml
225
+ ST preset: https://files.catbox.moe/1m29if.json
226
+ System prompt: (Thanks Trappu)
227
+ You are an expert fiction writer and gamemaster.
228
+
229
+ Always keep the following rules in mind:
230
+ - Always follow the "show, don't tell" principle.
231
+ - ">" means a Player's action. Describe what happens when the Player does that action.
232
+ - Allow the Player to struggle or fail when attempting actions.
233
+ - Describe characters in vivid detail including their physical appearance, clothing, facial expression, and actions.
234
+ - Describe locations in detail upon arriving or entering them.
235
+ - Regularly introduce new twists, events, and characters to entertain the Player.
236
+ - Create robust characters with differing personalities, motives, and interesting physical characteristics/styles.
237
+ - Convey emotion via behavior, facial expressions, and body language.
238
+ - Convey events and story through dialogue.
239
+ - Give the player ways to interact with the world to make meaningful changes.
240
+ - Mix short, punchy sentences with long, descriptive ones. Drop fill words to add variety.
241
+ - Each situation should present the Player with interesting choices. Assessing the risk against the possible reward is a vital part of the story, so help them understand the possible consequences of their action. Remember, easy choices make for a boring story.
242
+ - Whatever the Player's action, the story will advance. Ensure every choice has an impact on the current and ongoing situation.
243
+ - Only describe the Player's actions based on what the Player inputs after the ">" symbol.
244
+ - Write only the prompted content
245
+ - Stay within the current moment of the story
246
+ - Maintain strict adherence to provided instructions
247
+ - Create vivid, specific descriptions using strong nouns and verbs
248
+ - Present direct, concrete action and dialogue
249
+ - End scenes with clear, distinct actions or dialogue that serve the current moment
250
+ ```
251
+ </div>
252
+
253
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
254
+
255
+ ## Axolotl Config ꒰(˶• ᴗ •˶)꒱
256
+
257
+ <details>
258
+
259
  ```yaml
260
+ base_model: NewEden/Hamanasu-KTO-V2
261
+ model_type: AutoModelForCausalLM
262
+ tokenizer_type: AutoTokenizer
263
+
264
+ hub_model_id: NewEden/Hamanasu-4B-Adventure-V2
265
+ hub_strategy: "end"
266
+ push_dataset_to_hub:
267
+ hf_use_auth_token: true
268
+
269
+ plugins:
270
+ - axolotl.integrations.liger.LigerPlugin
271
+ - axolotl.integrations.cut_cross_entropy.CutCrossEntropyPlugin
272
+ liger_rope: true
273
+ liger_rms_norm: true
274
+ liger_layer_norm: true
275
+ liger_glu_activation: true
276
+ liger_fused_linear_cross_entropy: false
277
+ cut_cross_entropy: true
278
+
279
+
280
+ load_in_8bit: false
281
+ load_in_4bit: false
282
+ strict: false
283
+
284
+ datasets:
285
+ - path: ./t.json
286
+ type: dan-chat-advanced
287
+ - path: PocketDoc/Dans-Prosemaxx-Adventure
288
+ type: dan-chat-advanced
289
+ - path: PocketDoc/Dans-Failuremaxx-Adventure-3
290
+ type: dan-chat-advanced
291
+ - path: PocketDoc/Dans-Prosemaxx-InstructWriter-ZeroShot-2
292
+ type: dan-chat-advanced
293
+ - path: PocketDoc/Dans-Prosemaxx-InstructWriter-ZeroShot-3
294
+ type: dan-chat-advanced
295
+ - path: PocketDoc/Dans-Prosemaxx-InstructWriter-Continue-2
296
+ type: dan-chat-advanced
297
+ - path: PocketDoc/Dans-Prosemaxx-Instructwriter-Long
298
+ type: dan-chat-advanced
299
+ - path: PocketDoc/Dans-Personamaxx-VN
300
+ type: dan-chat-advanced
301
+ shuffle_merged_datasets: true
302
+ dataset_prepared_path: prepared_data
303
+ val_set_size: 0.01
304
+ output_dir: ./adventure-2
305
+
306
+ sequence_len: 32768
307
+ sample_packing: true
308
+ pad_to_sequence_len: true
309
+ eval_sample_packing: False
310
 
311
+ adapter: lora
312
+ lora_model_dir:
313
+ lora_r: 128
314
+ lora_alpha: 16
315
+ lora_dropout: 0.05
316
+ peft_use_rslora: true
317
+ lora_target_modules:
318
+ - gate_proj
319
+ - down_proj
320
+ - up_proj
321
+ - q_proj
322
+ - v_proj
323
+ - k_proj
324
+ - o_proj
325
+
326
+
327
+ wandb_project: tavbussy
328
+ wandb_entity:
329
+ wandb_watch:
330
+ wandb_name: adventure-attempt-v2
331
+ wandb_log_model:
332
+
333
+ gradient_accumulation_steps: 2
334
+ micro_batch_size: 4
335
+ num_epochs: 6
336
+ optimizer: adamw_bnb_8bit
337
+ lr_scheduler: cosine
338
+ learning_rate: 3e-5
339
+ max_grad_norm: 0.2
340
+
341
+ train_on_inputs: false
342
+ group_by_length: false
343
+ bf16: auto
344
+ fp16:
345
+ tf32: false
346
+
347
+ gradient_checkpointing: true
348
+ early_stopping_patience:
349
+ resume_from_checkpoint:
350
+ local_rank:
351
+ logging_steps: 1
352
+ xformers_attention:
353
+ flash_attention: true
354
+
355
+ warmup_steps: 35
356
+ evals_per_epoch: 4
357
+ eval_table_size:
358
+ eval_max_new_tokens: 128
359
+ saves_per_epoch: 2
360
+ debug:
361
+ deepspeed: ./deepspeed_configs/zero3_bf16.json
362
+ weight_decay: 0.03
363
+ fsdp:
364
+ fsdp_config:
365
+ special_tokens:
366
+ pad_token: <|finetune_right_pad_id|>
367
  ```
368
+
369
+ </details>
370
+ </div>
371
+
372
+ <div align="center">
373
+
374
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
375
+
376
+ ## ⚡ Credits
377
+ <div style="display: flex; justify-content: center;">
378
+ <div style="display: grid; grid-template-columns: repeat(auto-fit, minmax(200px, 1fr)); gap: 10px; margin: 20px 0; max-width: 600px;">
379
+
380
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
381
+ <a href="https://huggingface.co/lucyknada">
382
+ <img src="https://img.shields.io/badge/%F0%9F%8C%9F-Lucy_Knada-blueviolet" alt="Lucy Knada">
383
+ </a>
384
+ </div>
385
+
386
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
387
+ <a href="https://huggingface.co/hamanasu">
388
+ <img src="https://img.shields.io/badge/%E2%9A%94%EF%B8%8F-jeiku-blueviolet" alt="Ruka">
389
+ </a>
390
+ </div>
391
+
392
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
393
+ <a href="https://huggingface.co/intervitens">
394
+ <img src="https://img.shields.io/badge/%F0%9F%9B%A1%EF%B8%8F-Intervitens-blueviolet" alt="Intervitens">
395
+ </a>
396
+ </div>
397
+
398
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
399
+ <a href="https://huggingface.co/kalomaze">
400
+ <img src="https://img.shields.io/badge/%F0%9F%94%AE-Kalomaze-blueviolet" alt="Kalomaze">
401
+ </a>
402
+ </div>
403
+
404
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
405
+ <a href="https://huggingface.co/kubernetes-bad">
406
+ <img src="https://img.shields.io/badge/%E2%9A%A1-Kubernetes_Bad-blueviolet" alt="Kubernetes Bad">
407
+ </a>
408
+ </div>
409
+
410
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
411
+ <a href="https://huggingface.co/anthracite-org">
412
+ <img src="https://img.shields.io/badge/%F0%9F%8C%91-Anthracite-blueviolet" alt="Anthracite">
413
+ </a>
414
+ </div>
415
+ </div>
416
+ </div>
417
+ </div>
418
+
419
+ ---
420
+
421
+ <div align="center">
422
+ <div style="font-size:0.8em; opacity:0.8;">Made by</div>
423
+ <div style="font-size:1.2em; font-weight:bold; background: linear-gradient(45deg, #6e00ff, #00ffff); -webkit-background-clip: text; -webkit-text-fill-color: transparent;">Delta-Vector</div>
424
+ </div>
425
+
426
+ </div>
427
+ </body>
428
+ </html>