metadata
base_model:
- Qwen/Qwen3-8B
datasets:
- Heng1999/dapo-en-10k
- Heng1999/Omni-MATH-512
language:
- en
license: apache-2.0
tags:
- tool-use
- reasoning
- mathematics
- qwen
pipeline_tag: text-generation
library_name: transformers
paper: 2508.19201
Understanding Tool-Integrated Reasoning (TIR Model)
This repository contains the official DAPO-trained Tool-Integrated Reasoning (TIR) model from the paper: Understanding Tool-Integrated Reasoning.
This model is a Qwen3-8B fine-tuned using the DAPO algorithm to effectively solve complex mathematical problems by integrating a Python code interpreter. It serves as the primary experimental model in our paper.