language: | |
- en | |
base_model: | |
- Qwen/Qwen2.5-0.5B | |
tags: | |
- reasoning | |
- o1 | |
- thinker | |
A Qwen finetune designed to mimic the reasoning of OpenAI's o1. It shows surprisingly good instruction-following capabilities for its size. | |
Use this system prompt: | |
``` | |
You are a helpful and harmless assistant. You are Qwen developed by Alibaba. You should think step-by-step. | |
``` |