What Is That Talk About? A Video-to-Text Summarization Dataset for Scientific Presentations Paper • 2502.08279 • Published Feb 12 • 1
MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly Paper • 2505.10610 • Published May 15 • 54
Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs Paper • 2502.05092 • Published Feb 7 • 8
PosterSum: A Multimodal Benchmark for Scientific Poster Summarization Paper • 2502.17540 • Published Feb 24 • 3
MovieSum: An Abstractive Summarization Dataset for Movie Screenplays Paper • 2408.06281 • Published Aug 12, 2024 • 9
Select and Summarize: Scene Saliency for Movie Script Summarization Paper • 2404.03561 • Published Apr 4, 2024 • 2
The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models Paper • 2404.05904 • Published Apr 8, 2024 • 9