Qwen3-8B-TIR-DAPO / README.md
Heng1999's picture
Improve model card: Add pipeline tag, library name, and paper metadata (#1)
68d86ed verified
metadata
base_model:
  - Qwen/Qwen3-8B
datasets:
  - Heng1999/dapo-en-10k
  - Heng1999/Omni-MATH-512
language:
  - en
license: apache-2.0
tags:
  - tool-use
  - reasoning
  - mathematics
  - qwen
pipeline_tag: text-generation
library_name: transformers
paper: 2508.19201

Understanding Tool-Integrated Reasoning (TIR Model)

This repository contains the official DAPO-trained Tool-Integrated Reasoning (TIR) model from the paper: Understanding Tool-Integrated Reasoning.

This model is a Qwen3-8B fine-tuned using the DAPO algorithm to effectively solve complex mathematical problems by integrating a Python code interpreter. It serves as the primary experimental model in our paper.