Submitted by akhaliq 35 Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis · 5 authors
Submitted by akhaliq 32 Chain of Code: Reasoning with a Language Model-Augmented Code Emulator · 10 authors 4
Submitted by akhaliq 26 Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians · 7 authors 3
Submitted by akhaliq 22 MotionCtrl: A Unified and Flexible Motion Controller for Video Generation · 7 authors 2
Submitted by akhaliq 17 HiFi4G: High-Fidelity Human Performance Rendering via Compact Gaussian Splatting · 8 authors
Submitted by akhaliq 11 MagicStick: Controllable Video Editing via Control Handle Transformations · 8 authors 2
Submitted by akhaliq 9 DreamComposer: Controllable 3D Object Generation via Multi-View Conditions · 8 authors
Submitted by akhaliq 7 LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning · 6 authors
Submitted by akhaliq 7 HybridNeRF: Efficient Neural Rendering via Adaptive Volumetric Surfaces · 8 authors