Darkhn
/

Qwen-2.5-Ink-v0.08-6.0bpw-h8-exl2

Safetensors

qwen2

6-bit

exl2

Model card Files Files and versions Community

Improve language tag

by lbourdois - opened Apr 27

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

+81

-67

Files changed (1) hide show

README.md +81 -67

README.md CHANGED Viewed

@@ -1,68 +1,82 @@
----
-base_model:
-- Qwen/Qwen2.5-72B-Instruct
-tags:
-- conversational
-- roleplay
-- chat
-license: other
-license_name: qwen
----
-# Qwen 2.5 72b RP Ink
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/634262af8d8089ebaefd410e/M9KSL64gppBVatmTdoQnG.png)
-A roleplay-focused LoRA finetune of Qwen 2.5 72b Instruct. Methodology and hyperparams inspired by [SorcererLM](https://huggingface.co/rAIfle/SorcererLM-8x22b-bf16) and [Slush](https://huggingface.co/crestf411/Q2.5-32B-Slush).
-Yet another model in the Ink series, following in the footsteps of [the 32b one](https://huggingface.co/allura-org/Qwen2.5-32b-RP-Ink) and [the Nemo one](https://huggingface.co/allura-org/MN-12b-RP-Ink)
-## Testimonials
-> [Compared to the 32b] felt a noticeable increase in coherence
-\- ShotMisser64
-> Yeah ep2's great!! made me actually wanna write a reply by myself for the first time in a few days
-\- Maw
-> This is the best RP I've ever had
-\- 59smoke
-> this makes me want to get another 3090 to run 72b
-\- dysfunctional
-## Dataset
-The worst mix of data you've ever seen. Like, seriously, you do not want to see the things that went into this model. It's bad.
-"this is like washing down an adderall with a bottle of methylated rotgut" - inflatebot
-Update: I have sent the (public datasets in the) data mix publicly already so here's that
-<details>
-<img src=https://cdn-uploads.huggingface.co/production/uploads/634262af8d8089ebaefd410e/JtjUoKtbOfBZfSSKojTcj.png>
-</details>
-## Quants
-[imatrix GGUFs by bartowski](https://huggingface.co/bartowski/Qwen2.5-72b-RP-Ink-GGUF)
-## Recommended Settings
-Chat template: ChatML
-Recommended samplers (not the be-all-end-all, try some on your own!):
-- Temp 0.83 / Top P 0.8 / Top A 0.3 / Rep Pen 1.03
-- Your samplers can go here! :3
-## Hyperparams
-### General
-- Epochs = 2
-- LR = 6e-5
-- LR Scheduler = Cosine
-- Optimizer = Paged AdamW 8bit
-- Effective batch size = 16
-### LoRA
-- Rank = 16
-- Alpha = 32
-- Dropout = 0.25 (Inspiration: [Slush](https://huggingface.co/crestf411/Q2.5-32B-Slush))
-## Credits
-Humongous thanks to the people who created and curated the original data
-Big thanks to all Allura members, for testing and emotional support ilya /platonic
-especially to inflatebot who made the model card's image :3
 Another big thanks to all the members of the ArliAI and BeaverAI Discord servers for testing! All of the people featured in the testimonials are from there :3

+---
+base_model:
+- Qwen/Qwen2.5-72B-Instruct
+tags:
+- conversational
+- roleplay
+- chat
+license: other
+license_name: qwen
+language:
+- zho
+- eng
+- fra
+- spa
+- por
+- deu
+- ita
+- rus
+- jpn
+- kor
+- vie
+- tha
+- ara
+---
+# Qwen 2.5 72b RP Ink
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/634262af8d8089ebaefd410e/M9KSL64gppBVatmTdoQnG.png)
+A roleplay-focused LoRA finetune of Qwen 2.5 72b Instruct. Methodology and hyperparams inspired by [SorcererLM](https://huggingface.co/rAIfle/SorcererLM-8x22b-bf16) and [Slush](https://huggingface.co/crestf411/Q2.5-32B-Slush).
+Yet another model in the Ink series, following in the footsteps of [the 32b one](https://huggingface.co/allura-org/Qwen2.5-32b-RP-Ink) and [the Nemo one](https://huggingface.co/allura-org/MN-12b-RP-Ink)
+## Testimonials
+> [Compared to the 32b] felt a noticeable increase in coherence
+\- ShotMisser64
+> Yeah ep2's great!! made me actually wanna write a reply by myself for the first time in a few days
+\- Maw
+> This is the best RP I've ever had
+\- 59smoke
+> this makes me want to get another 3090 to run 72b
+\- dysfunctional
+## Dataset
+The worst mix of data you've ever seen. Like, seriously, you do not want to see the things that went into this model. It's bad.
+"this is like washing down an adderall with a bottle of methylated rotgut" - inflatebot
+Update: I have sent the (public datasets in the) data mix publicly already so here's that
+<details>
+<img src=https://cdn-uploads.huggingface.co/production/uploads/634262af8d8089ebaefd410e/JtjUoKtbOfBZfSSKojTcj.png>
+</details>
+## Quants
+[imatrix GGUFs by bartowski](https://huggingface.co/bartowski/Qwen2.5-72b-RP-Ink-GGUF)
+## Recommended Settings
+Chat template: ChatML
+Recommended samplers (not the be-all-end-all, try some on your own!):
+- Temp 0.83 / Top P 0.8 / Top A 0.3 / Rep Pen 1.03
+- Your samplers can go here! :3
+## Hyperparams
+### General
+- Epochs = 2
+- LR = 6e-5
+- LR Scheduler = Cosine
+- Optimizer = Paged AdamW 8bit
+- Effective batch size = 16
+### LoRA
+- Rank = 16
+- Alpha = 32
+- Dropout = 0.25 (Inspiration: [Slush](https://huggingface.co/crestf411/Q2.5-32B-Slush))
+## Credits
+Humongous thanks to the people who created and curated the original data
+Big thanks to all Allura members, for testing and emotional support ilya /platonic
+especially to inflatebot who made the model card's image :3
 Another big thanks to all the members of the ArliAI and BeaverAI Discord servers for testing! All of the people featured in the testimonials are from there :3