Submitted by akhaliq 39 VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models · 3 authors 3
Submitted by akhaliq 33 The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning · 8 authors 4
Submitted by akhaliq 23 VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence · 10 authors 5
Submitted by akhaliq 18 LivePhoto: Real Image Animation with Text-guided Motion Control · 7 authors 3
Submitted by akhaliq 15 Rank-without-GPT: Building GPT-Independent Listwise Rerankers on Open-Source Large Language Models · 5 authors
Submitted by akhaliq 15 GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis · 7 authors 1
Submitted by akhaliq 14 LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models · 11 authors
Submitted by akhaliq 13 Fine-grained Controllable Video Generation via Object Appearance and Context · 7 authors
Submitted by akhaliq 11 StableDreamer: Taming Noisy Score Distillation Sampling for Text-to-3D · 10 authors 3
Submitted by akhaliq 11 Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models · 6 authors 2
Submitted by akhaliq 10 GPT4Point: A Unified Framework for Point-Language Understanding and Generation · 8 authors
Submitted by akhaliq 9 VideoRF: Rendering Dynamic Radiance Fields as 2D Feature Video Streams · 8 authors 3
Submitted by akhaliq 6 Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training · 9 authors 1
Submitted by akhaliq 6 Using Large Language Models to Accelerate Communication for Users with Severe Motor Impairments · 16 authors 1
Submitted by akhaliq 6 TextGenSHAP: Scalable Post-hoc Explanations in Text Generation with Long Documents · 6 authors 1