Enhancing Speech Emotion Recognition with Graph-Based Multimodal Fusion and Prosodic Features for the Speech Emotion Recognition in Naturalistic Conditions Challenge at Interspeech 2025 Paper • 2506.02088 • Published 28 days ago
FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion Paper • 2501.05586 • Published Jan 9
FairPIVARA: Reducing and Assessing Biases in CLIP-Based Multimodal Models Paper • 2409.19474 • Published Sep 28, 2024
Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 Challenge Paper • 2207.14418 • Published Jul 29, 2022
CAPIVARA: Cost-Efficient Approach for Improving Multilingual CLIP Performance on Low-Resource Languages Paper • 2310.13683 • Published Oct 20, 2023