Fine-tuning qwen3-4B for paper search query rewriting using PaSa dataset: https://huggingface.co/datasets/CarlanLark/pasa-dataset

Setting enable_thinking = false for chat_template in file tokenizer_config.json when training: {%- set enable_thinking = enable_thinking if enable_thinking is defined else false %}

Downloads last month
6
Safetensors
Model size
4.02B params
Tensor type
FP16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Kita0421/qwen3-pasa-sft

Base model

Qwen/Qwen3-4B-Base
Finetuned
Qwen/Qwen3-4B
Finetuned
(101)
this model

Dataset used to train Kita0421/qwen3-pasa-sft