Multimodal GGUFs Vision and audio models compatible with llama-server and llama-mtmd-cli Gemma 3 Collection 4 items • Updated 21 days ago • 15 ggml-org/Mistral-Small-3.1-24B-Instruct-2503-GGUF Image-Text-to-Text • Updated May 1 • 869 • 4 InternVL 3 and InternVL 2.5 Collection 10 items • Updated 21 days ago Qwen 2 VL and Qwen 2.5 VL Collection 4 items • Updated 21 days ago
VAD Voice Activity Detection (VAD) models for whisper.cpp. ggml-org/whisper-vad Updated 22 days ago • 1