onurulu17
/

qwen2.5-vl-3b-instruct-mimic-cxr

Text Generation

vision-language

Model card Files Files and versions

onurulu17 commited on 27 days ago

Commit

164f1d4

·

verified ·

1 Parent(s): 0a6dd8f

Update README.md

Files changed (1) hide show

README.md +26 -27

README.md CHANGED Viewed

@@ -18,33 +18,6 @@ The goal is to adapt a powerful **multimodal vision-language model** for **medic
 ---
-## Model Details
-- **Base model:** Qwen/Qwen2.5-VL-3B-Instruct
-- **Adapter type:** LoRA (PEFT)
-- **Training objective:** Supervised fine-tuning (SFT) on chest X-ray reports
-- **Dataset:** [MIMIC-CXR](https://physionet.org/content/mimic-cxr/2.0.0/) (radiology images + reports)
-- **Languages:** English (medical reporting domain)
-- **Frameworks:** `transformers`, `peft`, `trl`
----
-## Intended Uses
-### Direct Use
-- Generating radiology-style reports from chest X-ray images.
-- Research on applying large multimodal models to medical imaging tasks.
-### Downstream Use
-- Medical text generation tasks where radiological image context is available.
-- Adaptation for other healthcare VQA (Visual Question Answering) tasks.
-### Out-of-Scope Use
-⚠️ **Not for clinical decision-making.**
-This model is intended **for research purposes only**. Do not use it in medical practice without proper validation and regulatory approval.
----
 ## How to Use
 ```python
@@ -99,3 +72,29 @@ sample = [
 output = generate_text_from_sample(model, processor, sample)
 print(output)
 ```

 ---
 ## How to Use
 ```python
 output = generate_text_from_sample(model, processor, sample)
 print(output)
 ```
+---
+## Model Details
+- **Base model:** Qwen/Qwen2.5-VL-3B-Instruct
+- **Adapter type:** LoRA (PEFT)
+- **Training objective:** Supervised fine-tuning (SFT) on chest X-ray reports
+- **Dataset:** [MIMIC-CXR](https://physionet.org/content/mimic-cxr/2.0.0/) (radiology images + reports)
+- **Languages:** English (medical reporting domain)
+- **Frameworks:** `transformers`, `peft`, `trl`
+---
+## Intended Uses
+### Direct Use
+- Generating radiology-style reports from chest X-ray images.
+- Research on applying large multimodal models to medical imaging tasks.
+### Downstream Use
+- Medical text generation tasks where radiological image context is available.
+- Adaptation for other healthcare VQA (Visual Question Answering) tasks.
+### Out-of-Scope Use
+⚠️ **Not for clinical decision-making.**
+This model is intended **for research purposes only**. Do not use it in medical practice without proper validation and regulatory approval.