prithivMLmods/KAIROS-MM-Qwen2.5-VL-7B-RL-AIO-GGUF Video-Text-to-Text • 8B • Updated 23 minutes ago • 1
prithivMLmods/KAIROS-MM-Qwen2.5-VL-7B-RL-AIO-GGUF Video-Text-to-Text • 8B • Updated 23 minutes ago • 1
prithivMLmods/KAIROS-MM-Qwen2.5-VL-7B-RL-AIO-GGUF Video-Text-to-Text • 8B • Updated 23 minutes ago • 1
Camel-Doc-OCR-080125 Collection Document Retrieval, Content Extraction, and Analysis Recognition. • 4 items • Updated about 1 hour ago • 2
view reply @nroggendorff I’ve removed the image comparator (Rerun component) from the Hugging Face demo, as users found it difficult to download the resulting image. However, it is still available in the GitHub.😊
Running on Zero MCP Featured 78 Qwen-Image-Edit-2511-LoRAs-Fast 🎃 78 Demo of the Collection of Qwen Image Edit LoRAs
Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone Paper • 2512.22615 • Published 4 days ago • 34
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published 2 days ago • 59
prithivMLmods/KAIROS-MM-Qwen2.5-VL-7B-SFT-GGUF-AIO-GGUF Video-Text-to-Text • 8B • Updated 1 day ago • 112 • 1
prithivMLmods/KAIROS-MM-Qwen2.5-VL-7B-SFT-GGUF-AIO-GGUF Video-Text-to-Text • 8B • Updated 1 day ago • 112 • 1
prithivMLmods/KAIROS-MM-Qwen2.5-VL-7B-SFT-GGUF-AIO-GGUF Video-Text-to-Text • 8B • Updated 1 day ago • 112 • 1
prithivMLmods/KAIROS-MM-Qwen2.5-VL-7B-SFT-GGUF-AIO-GGUF Video-Text-to-Text • 8B • Updated 1 day ago • 112 • 1