A Multimodal Approach to Device-Directed Speech Detection with Large Language Models Paper • 2403.14438 • Published Mar 21, 2024 • 2
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation Paper • 2403.17694 • Published Mar 26, 2024 • 11
SONAR: Sentence-Level Multimodal and Language-Agnostic Representations Paper • 2308.11466 • Published Aug 22, 2023 • 1