Tina-Yi/Tina-Open-RS3-format-only

Introduction

Tina (Tiny Reasoning Models via LoRA) models are all fine-tuned adapters on the base model deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B. This LoRA adapter in this repo is fine-tuned with the dataset knoveleng/open-rs. Please refer to our paper Tina: Tiny Reasoning Models via LoRA for more training details.

Example Usage

The Tina model is meant to be used in combination with the base model as a standard adapter. Particularly, we release all checkpoints we have for each Tina model and one could select different checkpoint to use by specifying the subfolder.

from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel

base_model = AutoModelForCausalLM.from_pretrained(
  "deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B",
  device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(
  "deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B"
)

model = PeftModel.from_pretrained(
  base_model,
  "Tina-Yi/Tina-Open-RS3-format-only",
  subfolder="checkpoint-850" # checkpoint 850 is the best
)

Tina-Yi
/

Tina-Open-RS3-format-only

Introduction

Example Usage

Model tree for Tina-Yi/Tina-Open-RS3-format-only

Dataset used to train Tina-Yi/Tina-Open-RS3-format-only

Collection including Tina-Yi/Tina-Open-RS3-format-only

Tina - Ablation Studies