stepfun-ai
/

Qwen2.5-32B-DialogueReason

Model card Files Files and versions Community

buyun commited on May 12

Commit

e40c27b

·

verified ·

1 Parent(s): ad128b9

Update README.md

Files changed (1) hide show

README.md +10 -3

README.md CHANGED Viewed

@@ -1,3 +1,10 @@
----
-license: apache-2.0
----

+## Introduction
+Qwen2.5-32B-DialogueReason is a dialogue-based reasoning model built on Qwen2.5-32B-Base.
+We train the model using [Open-Reasoner-Zero](https://github.com/Open-Reasoner-Zero/Open-Reasoner-Zero) data through rule-based reinforcement learning.
+## 🧠 Key Features
+- Qwen2.5-32B-Base as the foundation.
+- Use Rule-Based RL to achieve dialogue reasoning.
+- With dynamic agent initialization to adapt to various scenarios.
+- With flexible environment configuration to set up task-specific contexts.
+- With multi-turn dialogue reasoning to incrementally solve problems.