Multimodal VLLM - a phuong-d-h-nguyen Collection

phuong-d-h-nguyen 's Collections

Fine-tuning LLM

Multimodal VLLM

RAG

LLM

CoT

Multimodal VLLM

updated Jun 22, 2024

MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

Paper • 2401.15947 • Published Jan 29, 2024 • 53
The (R)Evolution of Multimodal Large Language Models: A Survey

Paper • 2402.12451 • Published Feb 19, 2024
deepseek-ai/deepseek-vl-7b-base

Updated Mar 15, 2024 • 1.04k • 59
Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts

Paper • 2405.11273 • Published May 18, 2024 • 17
Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published May 16, 2024 • 131
An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27, 2024 • 90