Arcana Qwen3 Abliterated
Collection
Abliterated components of https://huggingface.co/suayptalha/Arcana-Qwen3-2.4B-A0.6B
•
4 items
•
Updated
This project performs full fine-tuning on the Qwen3-0.6B language model to enhance its medical reasoning and clinical understanding capabilities. Training was conducted on the FreedomIntelligence/medical-o1-reasoning-SFT
dataset using bfloat16 (bf16) precision for efficient optimization.
Additionally, it has been abliterated to make it steer away from censorship.
Dataset Preparation
FreedomIntelligence/medical-o1-reasoning-SFT
dataset was used.Model Loading and Configuration
unsloth
library in bf16 precision.full_finetuning=True
) to effectively adapt the model to medical reasoning and decision-making tasks.Supervised Fine-Tuning
This project is licensed under the Apache License 2.0. See the LICENSE file for details.
Base model
Qwen/Qwen3-0.6B-Base