EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions Paper • 2402.17485 • Published Feb 27, 2024 • 195
ChatMusician: Understanding and Generating Music Intrinsically with LLM Paper • 2402.16153 • Published Feb 25, 2024 • 61
Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration Paper • 2306.09093 • Published Jun 15, 2023 • 15