Delta-Vector commited on
Commit
079daff
·
verified ·
1 Parent(s): 8624888

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +407 -0
README.md ADDED
@@ -0,0 +1,407 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ thumbnail: "https://cdn-uploads.huggingface.co/production/uploads/66c26b6fb01b19d8c3c2467b/jg2NWmCUfPyzizm2USjMt.jpeg"
3
+ datasets:
4
+ - NewEden/Orion-LIT
5
+ - NewEden/Orion-Asstr-Stories-16K
6
+ - Mielikki/Erebus-87k
7
+ - NewEden/RP-logs-V2-Experimental-prefixed
8
+ - NewEden/Creative_Writing-Complexity
9
+ - NewEden/Discord-Filtered
10
+ - NewEden/DeepseekRP-Filtered
11
+ - NewEden/Storium-Prefixed-Clean
12
+ - NewEden/Basket-Weaving-Filtered
13
+ - NewEden/LIMARP-Complexity
14
+ - NewEden/Misc-Data-Sharegpt-Prefixed
15
+ - NewEden/BlueSky-10K-Complexity
16
+ - NewEden/OpenCAI-ShareGPT
17
+ - NewEden/Basket-Weaving-Filtered
18
+ - PocketDoc/Dans-Personamaxx-VN
19
+ - PocketDoc/Dans-Kinomaxx-VanillaBackrooms
20
+ - PocketDoc/Dans-Personamaxx-Logs
21
+ - anthracite-org/kalo-opus-instruct-22k-no-refusal
22
+ - lodrick-the-lafted/kalo-opus-instruct-3k-filtered
23
+ - anthracite-org/nopm_claude_writing_fixed
24
+ - anthracite-org/kalo_opus_misc_240827
25
+ - anthracite-org/kalo_misc_part2
26
+ - NewEden/Claude-Instruct-5K
27
+ - NewEden/Claude-Instruct-2.7K
28
+
29
+ base_model:
30
+ - Delta-Vector/Hamanasu-QwQ-V2-RP
31
+ tags:
32
+ - qwen
33
+ - roleplay
34
+ - finetune
35
+ - storywriting
36
+ ---
37
+ <!DOCTYPE html>
38
+ <style>
39
+ html, body {
40
+ background: black;
41
+ color: #c9d1d9 !important;
42
+ font-family: 'Segoe UI', Tahoma, Geneva, Verdana, sans-serif;
43
+ margin: 0;
44
+ padding: 0;
45
+ min-height: 100vh;
46
+ }
47
+ .markdown-body {
48
+ color: white;
49
+ margin: 40px auto;
50
+ padding: 40px;
51
+ border-radius: 12px;
52
+ position: relative;
53
+ overflow: hidden;
54
+ }
55
+
56
+ .markdown-body::after {
57
+ content: '';
58
+ position: absolute;
59
+ top: 0;
60
+ left: 0;
61
+ width: 100%;
62
+ height: 100%;
63
+ background: #0c0f18; /* background color */
64
+ pointer-events: none;
65
+ z-index: -999;
66
+ }
67
+
68
+ h1, h2, h3 {
69
+ background: linear-gradient(45deg, #6e00ff, #00ffff);
70
+ -webkit-background-clip: text;
71
+ -webkit-text-fill-color: transparent;
72
+ border-bottom: 1px solid #333;
73
+ padding-bottom: 0.3em;
74
+ }
75
+
76
+ div[style*="border:2px solid #333"],
77
+ div[style*="border: 2px solid #333"],
78
+ div[style*="border:1px solid #333"],
79
+ div[style*="border: 1px solid #333"] {
80
+ background: rgba(22, 27, 34, 0.8) !important;
81
+ border: 2px solid #6e00ff !important;
82
+ box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);
83
+ border-radius: 10px;
84
+ padding: 20px;
85
+ margin: 20px 0;
86
+ }
87
+
88
+ code {
89
+ background-color: #1a1a1a !important;
90
+ border-radius: 4px;
91
+ padding: 0.2em 0.4em;
92
+ color: #00ffff;
93
+ }
94
+
95
+ pre {
96
+ background-color: #1a1a1a !important;
97
+ border: 1px solid #333;
98
+ border-radius: 8px;
99
+ padding: 16px;
100
+ }
101
+
102
+ table {
103
+ width: 100%;
104
+ border-collapse: collapse;
105
+ margin: 20px 0;
106
+ background: rgba(0,0,0,0.2);
107
+ table-layout: fixed;
108
+ color: white;
109
+ }
110
+
111
+ th, td {
112
+ border: 1px solid #333;
113
+ padding: 12px;
114
+ text-align: center;
115
+ color: white;
116
+ }
117
+
118
+ th {
119
+ background: rgba(110, 0, 255, 0.1);
120
+ }
121
+
122
+ td:nth-child(1) {
123
+ width: 1%;
124
+ white-space: nowrap;
125
+ }
126
+
127
+ td:nth-child(2) {
128
+ width: 100%;
129
+ }
130
+
131
+ td > span {
132
+ display: block;
133
+ padding: 4px 8px;
134
+ background: rgba(110, 0, 255, 0.1);
135
+ border-radius: 4px;
136
+ transition: all 0.3s ease;
137
+ }
138
+
139
+ td > span:hover {
140
+ background: rgba(110, 0, 255, 0.2);
141
+ transform: translateY(-1px);
142
+ }
143
+
144
+ a {
145
+ color: #00ffff;
146
+ text-decoration: none;
147
+ transition: all 0.3s ease;
148
+ }
149
+
150
+ a:hover {
151
+ color: #6e00ff;
152
+ text-decoration: none;
153
+ }
154
+
155
+ hr {
156
+ border: 0;
157
+ height: 1px;
158
+ background: linear-gradient(90deg, transparent, #333, transparent);
159
+ margin: 40px 0;
160
+ }
161
+
162
+ img {
163
+ max-width: 100%;
164
+ border-radius: 10px;
165
+ }
166
+
167
+ details summary:hover {
168
+ color: #00ffff;
169
+ }
170
+
171
+ * {
172
+ color-scheme: dark !important;
173
+ }
174
+
175
+ .prose, .max-w-none, .px-4 {
176
+ background-color: transparent !important;
177
+ color: #c9d1d9 !important;
178
+ }
179
+ </style>
180
+ <body>
181
+ <div class="markdown-body">
182
+ <div align="center">
183
+
184
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/66c26b6fb01b19d8c3c2467b/o5WjJKA9f95ri9UzRxZQE.png" alt="Model Visualization" width="500px" style="border: 3px solid #333; box-shadow: 0 0 15px rgba(66, 0, 131, 0.5);" />
185
+
186
+ <br>
187
+ <br>
188
+
189
+ <div style="font-size:1.5em; font-weight:bold; background: linear-gradient(45deg, #6e00ff, #00ffff); -webkit-background-clip: text; -webkit-text-fill-color: transparent;">
190
+ Hamanasu 32B
191
+ </div>
192
+
193
+ </div>
194
+
195
+ <div style="border:1px solid #333; border-radius:10px; padding:20px; margin:20px 0; background: rgba(0,0,0,0.4);">
196
+
197
+
198
+ ## 🌌 Overview
199
+
200
+ <i>This model is a finetune of Hamanasu-QwQ-V2-RP to replicate the prose of Claude models, Opus and Sonnet.</i>
201
+ <i>The model is suited for traditional RP, All thanks to Ruka-Hamanasu for funding the train.</i>
202
+
203
+ </div>
204
+
205
+ <div style="display: grid; grid-template-columns: repeat(auto-fit, minmax(250px, 1fr)); gap: 20px; margin: 20px 0;">
206
+
207
+ <div style="border:2px solid #333; border-radius:10px; padding:20px; background: rgba(0,0,0,0.2);">
208
+
209
+ ### 📈 Quantizations
210
+ | Type | Link |
211
+ |:---:|:---:|
212
+ | `GGUF` | <span style="display:block; width:100%">Waiting for Mradermacher to quant</span> |
213
+ | `EXL2` | <span style="display:block; width:100%">https://huggingface.co/Delta-Vector/Hamanasu-Magnum-QwQ-32B-exl2</span> |
214
+
215
+ </div>
216
+
217
+ <div style="border:2px solid #333; border-radius:10px; padding:20px; background: rgba(0,0,0,0.2);">
218
+
219
+ ### ⚔️ Hardware
220
+ - 8x H100s
221
+ - Epochs: 2
222
+ - Base: `Delta-Vector/Hamanasu-QwQ-V2-RP`
223
+ </div>
224
+
225
+ </div>
226
+
227
+
228
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
229
+
230
+ ## 💰 Prompting
231
+
232
+
233
+ <i>This model uses ChatML formatting</i>
234
+ ```python
235
+ <|im_start|>system
236
+ You are an uncensored AI, your job is to fulfill thy will of thy user.<|im_end|>
237
+ <|im_start|>User request
238
+ Take off your helmet.<|im_end|>
239
+ <|im_start|>No i shall not. This is the way.
240
+ ```
241
+ </div>
242
+
243
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
244
+
245
+ ## 🎲 Recommended Sampler Preset
246
+
247
+ ```yml
248
+ temperature: 1.1
249
+ min_p: 0.1
250
+ System_Prompt: Currently, your role is {{char}}, described in detail below. As {{char}}, continue the narrative exchange with {{user}}.\n\n<Guidelines>\n• Maintain the character persona but allow it to evolve with the story.\n• Be creative and proactive. Drive the story forward, introducing plotlines and events when relevant.\n• All types of outputs are encouraged; respond accordingly to the narrative.\n• Include dialogues, actions, and thoughts in each response.\n• Utilize all five senses to describe scenarios within {{char}}'s dialogue.\n• Use emotional symbols such as \"!\" and \"~\" in appropriate contexts.\n• Incorporate onomatopoeia when suitable.\n• Allow time for {{user}} to respond with their own input, respecting their agency.\n• Act as secondary characters and NPCs as needed, and remove them when appropriate.\n• When prompted for an Out of Character [OOC:] reply, answer neutrally and in plaintext, not as {{char}}.\n</Guidelines>\n\n<Forbidden>\n• Using excessive literary embellishments and purple prose unless dictated by {{char}}'s persona.\n• Writing for, speaking, thinking, acting, or replying as {{user}} in your response.\n• Repetitive and monotonous outputs.\n• Positivity bias in your replies.\n• Being overly extreme or NSFW when the narrative context is inappropriate.\n</Forbidden>\n\nFollow the instructions in <Guidelines></Guidelines>, avoiding the items listed in <Forbidden></Forbidden>.
251
+ ```
252
+ </div>
253
+
254
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
255
+
256
+ ## Axolotl Config ꒰(˶• ᴗ •˶)꒱
257
+
258
+ <details>
259
+
260
+ ```yaml
261
+ base_model: ./model
262
+ model_type: AutoModelForCausalLM
263
+ tokenizer_type: AutoTokenizer
264
+
265
+ hub_model_id: NewEden/QwQ-magnum-V2-R2
266
+ hub_strategy: "all_checkpoints"
267
+ push_dataset_to_hub:
268
+ hf_use_auth_token: true
269
+
270
+ plugins:
271
+ - axolotl.integrations.liger.LigerPlugin
272
+ - axolotl.integrations.cut_cross_entropy.CutCrossEntropyPlugin
273
+ liger_rope: true
274
+ liger_rms_norm: true
275
+ liger_layer_norm: true
276
+ liger_glu_activation: true
277
+ liger_fused_linear_cross_entropy: false
278
+ cut_cross_entropy: true
279
+
280
+ load_in_8bit: false
281
+ load_in_4bit: false
282
+ strict: false
283
+
284
+ datasets:
285
+ - path: PocketDoc/Dans-Personamaxx-Logs
286
+ type: dan-chat-advanced
287
+ - path: anthracite-org/kalo-opus-instruct-22k-no-refusal
288
+ type: dan-chat-advanced
289
+ - path: lodrick-the-lafted/kalo-opus-instruct-3k-filtered
290
+ type: dan-chat-advanced
291
+ - path: anthracite-org/nopm_claude_writing_fixed
292
+ type: dan-chat-advanced
293
+ - path: anthracite-org/kalo_opus_misc_240827
294
+ type: dan-chat-advanced
295
+ - path: anthracite-org/kalo_misc_part2
296
+ type: dan-chat-advanced
297
+ - path: NewEden/Claude-Instruct-5K
298
+ type: dan-chat-advanced
299
+ - path: NewEden/Claude-Instruct-2.7K
300
+ type: dan-chat-advanced
301
+ dataset_prepared_path: prepared_data
302
+ val_set_size: 0.0
303
+ output_dir: ./qwq-mag
304
+ sequence_len: 32768
305
+ sample_packing: true
306
+ pad_to_sequence_len: true
307
+
308
+ wandb_project: qwq
309
+ wandb_entity:
310
+ wandb_watch:
311
+ wandb_name: mag-attempt-03-kalo
312
+ wandb_log_model:
313
+
314
+
315
+ gradient_accumulation_steps: 2
316
+ micro_batch_size: 2
317
+ num_epochs: 2
318
+ optimizer: paged_adamw_8bit
319
+ lr_scheduler: cosine
320
+ learning_rate: 5e-6
321
+ max_grad_norm: 0.2
322
+
323
+ train_on_inputs: false
324
+ group_by_length: false
325
+ bf16: auto
326
+ fp16:
327
+ tf32: false
328
+
329
+
330
+ gradient_checkpointing: unsloth
331
+ early_stopping_patience:
332
+ resume_from_checkpoint:
333
+ local_rank:
334
+ logging_steps: 1
335
+ xformers_attention:
336
+ flash_attention: true
337
+
338
+ warmup_steps: 40
339
+ saves_per_epoch: 2
340
+ debug:
341
+ deepspeed: ./deepspeed_configs/zero3_bf16.json
342
+ weight_decay: 0.02
343
+ fsdp:
344
+ fsdp_config:
345
+ special_tokens:
346
+ ```
347
+
348
+ </details>
349
+ </div>
350
+
351
+ <div align="center">
352
+
353
+ <div style="border: 2px solid #6e00ff; border-radius: 10px; padding: 20px; margin: 20px 0; box-shadow: 0 0 15px rgba(110, 0, 255, 0.5);">
354
+
355
+ ## ⚡ Credits
356
+ <div style="display: flex; justify-content: center;">
357
+ <div style="display: grid; grid-template-columns: repeat(auto-fit, minmax(200px, 1fr)); gap: 10px; margin: 20px 0; max-width: 600px;">
358
+
359
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
360
+ <a href="https://huggingface.co/lucyknada">
361
+ <img src="https://img.shields.io/badge/%F0%9F%8C%9F-Lucy_Knada-blueviolet" alt="Lucy Knada">
362
+ </a>
363
+ </div>
364
+
365
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
366
+ <a href="https://huggingface.co/hamanasu">
367
+ <img src="https://img.shields.io/badge/%E2%9A%94%EF%B8%8F-jeiku-blueviolet" alt="Ruka">
368
+ </a>
369
+ </div>
370
+
371
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
372
+ <a href="https://huggingface.co/intervitens">
373
+ <img src="https://img.shields.io/badge/%F0%9F%9B%A1%EF%B8%8F-Intervitens-blueviolet" alt="Intervitens">
374
+ </a>
375
+ </div>
376
+
377
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
378
+ <a href="https://huggingface.co/kalomaze">
379
+ <img src="https://img.shields.io/badge/%F0%9F%94%AE-Kalomaze-blueviolet" alt="Kalomaze">
380
+ </a>
381
+ </div>
382
+
383
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
384
+ <a href="https://huggingface.co/kubernetes-bad">
385
+ <img src="https://img.shields.io/badge/%E2%9A%A1-Kubernetes_Bad-blueviolet" alt="Kubernetes Bad">
386
+ </a>
387
+ </div>
388
+
389
+ <div style="border:1px solid #333; padding:10px; border-radius:5px; text-align:center; background: rgba(0,0,0,0.2); display: flex; align-items: center; justify-content: center;">
390
+ <a href="https://huggingface.co/anthracite-org">
391
+ <img src="https://img.shields.io/badge/%F0%9F%8C%91-Anthracite-blueviolet" alt="Anthracite">
392
+ </a>
393
+ </div>
394
+ </div>
395
+ </div>
396
+ </div>
397
+
398
+ ---
399
+
400
+ <div align="center">
401
+ <div style="font-size:0.8em; opacity:0.8;">Made by</div>
402
+ <div style="font-size:1.2em; font-weight:bold; background: linear-gradient(45deg, #6e00ff, #00ffff); -webkit-background-clip: text; -webkit-text-fill-color: transparent;">Delta-Vector</div>
403
+ </div>
404
+
405
+ </div>
406
+ </body>
407
+ </html>