Ba2han commited on
Commit
24edf4b
·
verified ·
1 Parent(s): dee035e

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +29 -0
README.md ADDED
@@ -0,0 +1,29 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ language:
4
+ - en
5
+ base_model:
6
+ - Qwen/Qwen3-30B-A3B
7
+ ---
8
+
9
+
10
+ > [!NOTE]
11
+ > **Use "You are an assistant with reasoning capabilities." system message to trigger gemini-style thinking.**
12
+
13
+ # Training Dataset
14
+
15
+ - The fine-tuning dataset consists of ~450 diverse examples, 250 of which are directly from Gemini 2.5 Pro.
16
+
17
+ ## Trained on:
18
+ - Unsloth version of Qwen3-30B-A3B (instruct).
19
+ - 32k seq_len with examples ranging from 1k to ~20k tokens.
20
+ - Up to 2 turns of conversations.
21
+
22
+ ---
23
+
24
+ - No benchmark data for now.
25
+
26
+ **Keep in mind that it's slightly overfit since the training dataset was quite small. The model can be used to create more high quality examples for further training.**
27
+
28
+
29
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6324eabf05bd8a54c6eb1650/TEBe1XQvpJA2IZ63btFWT.png)