SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass Paper • 2508.15769 • Published 1 day ago • 10
SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding Paper • 2505.17012 • Published May 22 • 12
ProBench: Judging Multimodal Foundation Models on Open-ended Multi-domain Expert Tasks Paper • 2503.06885 • Published Mar 10 • 4
MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities Paper • 2412.04106 • Published Dec 4, 2024 • 6
MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning Paper • 2408.11001 • Published Aug 20, 2024 • 13
MatchTime: Towards Automatic Soccer Game Commentary Generation Paper • 2406.18530 • Published Jun 26, 2024 • 12
Boost Video Frame Interpolation via Motion Adaptation Paper • 2306.13933 • Published Jun 24, 2023 • 3
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models Paper • 2306.00973 • Published Jun 1, 2023 • 3