File size: 585 Bytes
e40c27b
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
## Introduction
Qwen2.5-32B-DialogueReason is a dialogue-based reasoning model built on Qwen2.5-32B-Base.  
We train the model using [Open-Reasoner-Zero](https://github.com/Open-Reasoner-Zero/Open-Reasoner-Zero) data through rule-based reinforcement learning.  

## 🧠 Key Features
- Qwen2.5-32B-Base as the foundation.
- Use Rule-Based RL to achieve dialogue reasoning.
- With dynamic agent initialization to adapt to various scenarios.
- With flexible environment configuration to set up task-specific contexts.
- With multi-turn dialogue reasoning to incrementally solve problems.