Qwen3-8B-TIR-DAPO / README.md
Heng1999's picture
Improve model card: Add pipeline tag, library name, and paper metadata (#1)
68d86ed verified
---
base_model:
- Qwen/Qwen3-8B
datasets:
- Heng1999/dapo-en-10k
- Heng1999/Omni-MATH-512
language:
- en
license: apache-2.0
tags:
- tool-use
- reasoning
- mathematics
- qwen
pipeline_tag: text-generation
library_name: transformers
paper: 2508.19201
---
# Understanding Tool-Integrated Reasoning (TIR Model)
This repository contains the official DAPO-trained Tool-Integrated Reasoning (TIR) model from the paper: **[Understanding Tool-Integrated Reasoning](https://arxiv.org/abs/2508.19201)**.
This model is a Qwen3-8B fine-tuned using the DAPO algorithm to effectively solve complex mathematical problems by integrating a Python code interpreter. It serves as the primary experimental model in our paper.