Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Lingshu: MLLMs for Unified Multimodal Medical Understanding and Reasoning
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
Welcome to the Lingshu project - A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning.
Highlights:
- Lingshu supports more than 12 medical imaging modalities, including X-Ray, CT Scan, MRI, Microscopy, Ultrasound, Histopathology, Dermoscopy, Fundus, OCT, Digital Photography, Endoscopy, and PET.
- Lingshu models achieve SOTA on most medical multimodal/textual QA and report generation tasks for 7B and 32 model sizes.
- Lingshu-32B outperforms GPT-4.1 and Claude Sonnet 4 in most multimodal QA and report generation tasks.
Quick links:
- Models are available in multiple model sizes: Lingshu-7B, Lingshu-32B
- Paper: Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning
Collections
1
models
2
datasets
0
None public yet