buyun commited on
Commit
e40c27b
·
verified ·
1 Parent(s): ad128b9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -3
README.md CHANGED
@@ -1,3 +1,10 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
1
+ ## Introduction
2
+ Qwen2.5-32B-DialogueReason is a dialogue-based reasoning model built on Qwen2.5-32B-Base.
3
+ We train the model using [Open-Reasoner-Zero](https://github.com/Open-Reasoner-Zero/Open-Reasoner-Zero) data through rule-based reinforcement learning.
4
+
5
+ ## 🧠 Key Features
6
+ - Qwen2.5-32B-Base as the foundation.
7
+ - Use Rule-Based RL to achieve dialogue reasoning.
8
+ - With dynamic agent initialization to adapt to various scenarios.
9
+ - With flexible environment configuration to set up task-specific contexts.
10
+ - With multi-turn dialogue reasoning to incrementally solve problems.