Our advanced models and datasets for exploring the frontiers of MT
AI & ML interests
None defined yet.
Recent Activity
Papers
DeepWideSearch: Benchmarking Depth and Width in Agentic Information Seeking
HSCodeComp: A Realistic and Expert-level Benchmark for Deep Search Agents in Hierarchical Rule Application
An unified model for multimodal understanding, text-to-image generation, and image editing.
With 29B parameters, Ovis1.6-Gemma2-27B achieves exceptional performance in the OpenCompass benchmark, ranking among the top-tier open-source MLLMs.
-
AIDC-AI/Ovis1.6-Gemma2-27B
Image-Text-to-Text β’ 29B β’ Updated β’ 109 β’ 63 -
AIDC-AI/Ovis1.6-Gemma2-9B
Image-Text-to-Text β’ 10B β’ Updated β’ 333 β’ 276 -
AIDC-AI/Ovis1.6-Gemma2-9B-GPTQ-Int4
Image-Text-to-Text β’ Updated β’ 111 β’ 9 -
AIDC-AI/Ovis1.6-Llama3.2-3B
Image-Text-to-Text β’ 4B β’ Updated β’ 206 β’ 49
Our advanced models and datasets for exploring the frontiers of MT
Our next-generation MLLMs for native-resolution vision and advanced reasoning
An unified model for multimodal understanding, text-to-image generation, and image editing.
Our latest advancement in multi-modal large language models (MLLMs)
With 29B parameters, Ovis1.6-Gemma2-27B achieves exceptional performance in the OpenCompass benchmark, ranking among the top-tier open-source MLLMs.
-
AIDC-AI/Ovis1.6-Gemma2-27B
Image-Text-to-Text β’ 29B β’ Updated β’ 109 β’ 63 -
AIDC-AI/Ovis1.6-Gemma2-9B
Image-Text-to-Text β’ 10B β’ Updated β’ 333 β’ 276 -
AIDC-AI/Ovis1.6-Gemma2-9B-GPTQ-Int4
Image-Text-to-Text β’ Updated β’ 111 β’ 9 -
AIDC-AI/Ovis1.6-Llama3.2-3B
Image-Text-to-Text β’ 4B β’ Updated β’ 206 β’ 49
Ovis1.5 is fully open-source: we release training datasets, training & inference codes, and model weights.