FM-1976/gemma-2b-docjoybot-lora-F16-GGUF
This LoRA adapter was converted to GGUF format from Joy10/gemma-2b-docjoybot-lora
via the ggml.ai's GGUF-my-lora space.
Refer to the original adapter repository for more details.
Model Description
π©Ί A Medical Reasoning Chatbot Based on Gemma-2B + LoRA
Trained a fine-tuned version of google/gemma-2-2b-it
enhanced with LoRA adapters. It specializes in medical question answering and clinical reasoning using structured, step-by-step thought processes.
π Key Features
- π§ Chain-of-Thought (CoT) Reasoning for complex medical queries
- π§ͺ Fine-tuned on 25,000 samples from
FreedomIntelligence/medical-o1-reasoning-SFT
- 𧬠LoRA-based parameter-efficient tuning using Hugging Face PEFT + TRL
- π‘ Prompt template includes structured
<think>
tags to enhance reasoning clarity - β‘ Lightweight adapter (~10MB) for efficient deployment with the base model
π Intended Use
This model is intended for educational, research, and prototyping purposes in the healthcare and AI domains. It performs best on medical diagnostic and reasoning tasks where step-by-step logical thinking is required.
β οΈ Disclaimer: This model is not intended for real-world clinical use without expert validation. It is a research-grade assistant only.
ποΈ How It Was Trained
- Base Model:
google/gemma-2-2b-it
- LoRA Config:
r=8
,alpha=16
,dropout=0.05
- Frameworks:
transformers
,PEFT
,TRL (SFTTrainer)
- Quantization: 4-bit
nf4
for efficient inference usingbitsandbytes
- Hardware: Trained on Kaggle GPU (T4), optimized for low-resource fine-tuning
π¬ Prompt Format
You are a helpful and knowledgeable AI medical assistant.
### Question:
{medical_question_here}
### Response:
<think>
{step-by-step_reasoning}
</think>
{final_answer}
Use with llama.cpp
# with cli
llama-cli -m base_model.gguf --lora gemma-2b-docjoybot-lora-f16.gguf (...other args)
# with server
llama-server -m base_model.gguf --lora gemma-2b-docjoybot-lora-f16.gguf (...other args)
To know more about LoRA usage with llama.cpp server, refer to the llama.cpp server documentation.
- Downloads last month
- 4
16-bit