Update README.md
Browse files
README.md
CHANGED
@@ -1,5 +1,9 @@
|
|
1 |
🧪 Gemma-2B-DolphinR1-TestV2 (Experimental Fine-Tune) 🧪
|
2 |
-
|
|
|
|
|
|
|
|
|
3 |
|
4 |
🚨 Disclaimer: This model is very much a work in progress and is still being tested for performance, reliability, and generalization. Expect quirks, inconsistencies, and potential overfitting in responses.
|
5 |
|
|
|
1 |
🧪 Gemma-2B-DolphinR1-TestV2 (Experimental Fine-Tune) 🧪
|
2 |
+
|
3 |
+
|
4 |
+
This is an experimental fine-tune of Google's Gemma-2B using the [Dolphin-R1 dataset](https://huggingface.co/datasets/cognitivecomputations/dolphin-r1).
|
5 |
+
|
6 |
+
The goal is to enhance reasoning and chain-of-thought capabilities while maintaining efficiency with LoRA (r=32) and 4-bit quantization.
|
7 |
|
8 |
🚨 Disclaimer: This model is very much a work in progress and is still being tested for performance, reliability, and generalization. Expect quirks, inconsistencies, and potential overfitting in responses.
|
9 |
|