A collection of EMOVA models (https://emova-ollm.github.io/)

EMOVA Hugging Face
Enterprise
community
AI & ML interests
Omni-modal Large Language Models, Multi-modal Large Language Models (MLLMs), Emotional spoken dialogue
Recent Activity
View all activity
Organization Card
👋 Welcome to EMOVA! We are a team focusing on fully open-sourced omni-modal foundational models with visual, textual, and speech capabilities. EMOVA (EMotionally Omni-present Voice Assistant) is a novel Omni-modal Large Language Model with end-to-end speech capabilities while maintaining state-of-the-art vision-language performance. We wish to promote the development of omni-modal human interactions with intelligent models!
models
13

Emova-ollm/emova-qwen-2-5-72b-hf
Feature Extraction
•
Updated
•
22
•
2

Emova-ollm/emova-qwen-2-5-72b
Text Generation
•
Updated
•
10
•
1

Emova-ollm/emova-qwen-2-5-7b-hf
Feature Extraction
•
Updated
•
43
•
2

Emova-ollm/emova-qwen-2-5-7b
Text Generation
•
Updated
•
18
•
1

Emova-ollm/emova-qwen-2-5-3b-hf
Feature Extraction
•
Updated
•
78
•
5

Emova-ollm/emova-qwen-2-5-3b
Text Generation
•
Updated
•
181
•
2

Emova-ollm/qwen2vit600m
Feature Extraction
•
Updated
•
257

Emova-ollm/Meta-Llama-3.1-8B-Instruct_add_speech_token_4096_nostrip-2
Feature Extraction
•
Updated
•
1

Emova-ollm/Qwen2.5-7B-Instruct_add_speech_token_4096_nostrip
Feature Extraction
•
Updated
•
2

Emova-ollm/Qwen2.5-3B-Instruct_add_speech_token_4096_nostrip
Text Generation
•
Updated
•
1
datasets
5
Emova-ollm/emova-alignment-7m
Viewer
•
Updated
•
6.18M
•
2.82k
•
1
Emova-ollm/emova-sft-speech-eval
Viewer
•
Updated
•
3.76k
•
101
Emova-ollm/emova-asr-tts-eval
Viewer
•
Updated
•
5.24k
•
79
Emova-ollm/emova-sft-speech-231k
Viewer
•
Updated
•
231k
•
316
•
2
Emova-ollm/emova-sft-4m
Viewer
•
Updated
•
4.31M
•
2.01k
•
1