Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Guilherme34
/
Samantha-omni
like
2
Any-to-Any
Transformers
Safetensors
openbmb/RLAIF-V-Dataset
multilingual
minicpmo
feature-extraction
minicpm-o
omni
vision
ocr
multi-image
video
custom_code
audio
speech
voice cloning
live Streaming
realtime speech conversation
asr
tts
arxiv:
2408.01800
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
040782e
Samantha-omni
70.6 MB
1 contributor
History:
23 commits
Guilherme34
Upload configuration_minicpm.py with huggingface_hub
040782e
verified
about 2 months ago
assets
Upload assets/input_examples/indian-accent.wav with huggingface_hub
about 2 months ago
.gitattributes
Safe
1.9 kB
Upload assets/input_examples/audio_understanding.mp3 with huggingface_hub
about 2 months ago
README.md
Safe
50.4 kB
Upload README.md with huggingface_hub
about 2 months ago
added_tokens.json
Safe
1.41 kB
Upload added_tokens.json with huggingface_hub
about 2 months ago
config.json
Safe
3.44 kB
Upload config.json with huggingface_hub
about 2 months ago
configuration_minicpm.py
Safe
7.55 kB
Upload configuration_minicpm.py with huggingface_hub
about 2 months ago