Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers Paper β’ 2505.21497 β’ Published 13 days ago β’ 96
Emerging Properties in Unified Multimodal Pretraining Paper β’ 2505.14683 β’ Published 20 days ago β’ 129
Shifting AI Efficiency From Model-Centric to Data-Centric Compression Paper β’ 2505.19147 β’ Published 15 days ago β’ 145
Shifting AI Efficiency From Model-Centric to Data-Centric Compression Paper β’ 2505.19147 β’ Published 15 days ago β’ 145
Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation Paper β’ 2504.14899 β’ Published Apr 21 β’ 21
CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives Paper β’ 2504.10823 β’ Published Apr 15 β’ 14
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes Paper β’ 2503.23461 β’ Published Mar 30 β’ 95
Running 317 317 Qwen2.5 Omni 7B Demo π Generate text and speech responses from text, images, or audio input
Running 10 10 Deep Reinforcement Learning Leaderboard π Display and search trained RL models on a leaderboard
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Paper β’ 2503.09573 β’ Published Mar 12 β’ 72