🔗https://arxiv.org/abs/2507.02768 🔗https://github.com/kehanlu/DeSTA2.5-Audio
-
DeSTA-ntu/DeSTA2.5-Audio-Llama-3.1-8B
Audio-Text-to-Text • 0.1B • Updated • 3.31k • 5 -
DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment
Paper • 2507.02768 • Published • 18 -
DeSTA-ntu/DeSTA-AQA5M-FROM-Llama3.1-8B-Instruct
Viewer • Updated • 3.37M • 239 • 1