--- license: mit language: - en base_model: - Qwen/Qwen3-30B-A3B --- > [!NOTE] > **Use "You are an assistant with reasoning capabilities." system message to consistently trigger gemini-style thinking.** > [!NOTE] > **I'm working on improving the dataset & model and will release a new, full version.** # Training Dataset - The fine-tuning dataset consists of ~450 diverse examples, 250 of which are directly from Gemini 2.5 Pro. ## Trained on: - Unsloth version of Qwen3-30B-A3B (instruct). - 32k seq_len with examples ranging from 1k to ~20k tokens. - Up to 2 turns of conversations. --- - No benchmark data for now. **Keep in mind that it's slightly overfit since the training dataset was quite small. The model can be used to create more high quality examples for further training.** ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6324eabf05bd8a54c6eb1650/TEBe1XQvpJA2IZ63btFWT.png)