--- library_name: transformers pipeline_tag: text-generation base_model: - Qwen/Qwen2.5-3B --- ## UFT This repository contains the model presented in [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://huggingface.co/papers/2505.16984). Code: https://github.com/liumy2010/UFT ## References * [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://arxiv.org/abs/2505.16984)