DeepSeek-R1-Distill-Qwen-1.5B-medical models
Collection
Collection of merged models of DeepSeek-R1-Distill-Qwen-1.5B fine-tuned for adapting to medical domain in MedAdapt-LLM project.
•
6 items
•
Updated
This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.