Lingshu: MLLMs for Unified Multimodal Medical Understanding and Reasoning

community
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Welcome to the Lingshu project - A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning.

Highlights:

  • Lingshu supports more than 12 medical imaging modalities, including X-Ray, CT Scan, MRI, Microscopy, Ultrasound, Histopathology, Dermoscopy, Fundus, OCT, Digital Photography, Endoscopy, and PET.
  • Lingshu models achieve SOTA on most medical multimodal/textual QA and report generation tasks for 7B and 32 model sizes.
  • Lingshu-32B outperforms GPT-4.1 and Claude Sonnet 4 in most multimodal QA and report generation tasks.

Quick links:

datasets 0

None public yet