ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement
Paper
ā¢
2504.01934
ā¢
Published
ā¢
22
Omni-modal Large Language Models, Multi-modal Large Language Models (MLLMs), Emotional spoken dialogue