YorkieOH10 commited on
Commit
f5fea3c
·
verified ·
1 Parent(s): a79372e

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -0
README.md ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ 🧪 Gemma-2B-DolphinR1-TestV2 (Experimental Fine-Tune) 🧪
2
+ This is an experimental fine-tune of Google's Gemma-2B using the [Dolphin-R1 dataset](https://huggingface.co/datasets/cognitivecomputations/dolphin-r1). The goal is to enhance reasoning and chain-of-thought capabilities while maintaining efficiency with LoRA (r=32) and 4-bit quantization.
3
+
4
+ 🚨 Disclaimer: This model is very much a work in progress and is still being tested for performance, reliability, and generalization. Expect quirks, inconsistencies, and potential overfitting in responses.
5
+
6
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6339a8648f27255b6b51180c/gMsWZ5sRzDiftZFZ0tFxA.png)
7
+
8
+
9
+ This is made possible thanks to @unsloth. I am still very new at finetuning Large Language Models so this is more of a showcase of my learning journey. Remember, it's very experimental, do not recommend downloading or testing.