-
BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
Paper • 2412.04626 • Published • 14 -
GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI
Paper • 2411.14522 • Published • 34 -
Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination
Paper • 2411.03823 • Published • 45 -
Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data
Paper • 2410.18558 • Published • 19
Jialiang Cheng
Julius-L
·
AI & ML interests
None yet
Recent Activity
upvoted
a
collection
20 days ago
Deepseek Papers
updated
a collection
about 2 months ago
multimodal dataset
updated
a collection
about 2 months ago
multimodal dataset
Organizations
None yet
Collections
12
models
None public yet
datasets
None public yet