VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Paper • 2501.13106 • Published Jan 22 • 91
Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding Paper • 2311.16922 • Published Nov 28, 2023 • 1