Why Reasoning Matters? A Survey of Advancements in Multimodal Reasoning (v1) Paper • 2504.03151 • Published 5 days ago • 7
FreSca: Unveiling the Scaling Space in Diffusion Models Paper • 2504.02154 • Published 6 days ago • 17
Tri$^{2}$-plane: Thinking Head Avatar via Feature Pyramid Paper • 2401.09386 • Published Jan 17, 2024
Emo-Avatar: Efficient Monocular Video Style Avatar through Texture Rendering Paper • 2402.00827 • Published Feb 1, 2024 • 2
Adaptive Super Resolution For One-Shot Talking-Head Generation Paper • 2403.15944 • Published Mar 23, 2024
KinMo: Kinematic-aware Human Motion Understanding and Generation Paper • 2411.15472 • Published Nov 23, 2024
GestureLSM: Latent Shortcut based Co-Speech Gesture Generation with Spatial-Temporal Modeling Paper • 2501.18898 • Published Jan 31