Salim Al Sazu

salimalsazu

AI & ML interests

AI & ML Interests

Recent Activity

reacted to jeffboudier's post with 🔥 about 15 hours ago

Quick 30s demo of the new Hub > Azure AI integration to deploy HF models in your own Azure account. Now with Py and CLI! GG @alvarobartt @kramp @pagezyhf

reacted to AdinaY's post with 🔥 about 16 hours ago

MiniCPM-V 4.5 🚀 New MLLM for image, multi-image & video understanding, running even on your phone, released by OpenBMB https://huggingface.co/openbmb/MiniCPM-V-4_5 ✨ SOTA vision language capability ✨ 96× video token compression > high-FPS & long video reasoning ✨ Switchable fast vs deep thinking modes ✨ Strong OCR, document parsing, supports 30+ languages

replied to prithivMLmods's post about 16 hours ago

OpenGVLab's InternVL3_5-2B-MPO [Mixed Preference Optimization (MPO)] is a compact vision-language model in the InternVL3.5 series. You can now experience it in the Tiny VLMs Lab, an app featuring 15+ multimodal VLMs ranging from 250M to 4B parameters. These models support tasks such as OCR, reasoning, single-shot answering with small models, and captioning (including ablated variants), across a broad range of visual categories. They are also capable of handling images with complex, sensitive, or nuanced content, while adapting to varying aspect ratios and resolutions. ✨ Space/App : https://huggingface.co/spaces/prithivMLmods/Tiny-VLMs-Lab 🫙 Model : https://huggingface.co/OpenGVLab/InternVL3_5-2B-MPO ↗️ Collection: https://huggingface.co/collections/OpenGVLab/internvl35-68ac87bd52ebe953485927fb 🗞️ Paper : https://arxiv.org/pdf/2508.18265 ↗️ Multimodal Space Collection : https://huggingface.co/collections/prithivMLmods/multimodal-implementations-67c9982ea04b39f0608badb0 To learn more, visit the relevant spaces, collections, and model cards.

View all activity

Organizations

None yet

reacted to jeffboudier's post with 🔥 about 15 hours ago

Post

2570

Quick 30s demo of the new Hub > Azure AI integration to deploy HF models in your own Azure account. Now with Py and CLI!

GG @alvarobartt @kramp @pagezyhf

reacted to AdinaY's post with 🔥 about 16 hours ago

Post

3557

MiniCPM-V 4.5 🚀 New MLLM for image, multi-image & video understanding, running even on your phone, released by OpenBMB

openbmb/MiniCPM-V-4_5

✨ SOTA vision language capability
✨ 96× video token compression > high-FPS & long video reasoning
✨ Switchable fast vs deep thinking modes
✨ Strong OCR, document parsing, supports 30+ languages

replied to prithivMLmods's post about 16 hours ago

love this :)

reacted to prithivMLmods's post with 🤗 about 16 hours ago

Post

3454

OpenGVLab's InternVL3_5-2B-MPO [Mixed Preference Optimization (MPO)] is a compact vision-language model in the InternVL3.5 series. You can now experience it in the Tiny VLMs Lab, an app featuring 15+ multimodal VLMs ranging from 250M to 4B parameters. These models support tasks such as OCR, reasoning, single-shot answering with small models, and captioning (including ablated variants), across a broad range of visual categories. They are also capable of handling images with complex, sensitive, or nuanced content, while adapting to varying aspect ratios and resolutions.

✨ Space/App : prithivMLmods/Tiny-VLMs-Lab
🫙 Model : OpenGVLab/InternVL3_5-2B-MPO
↗️ Collection: OpenGVLab/internvl35-68ac87bd52ebe953485927fb
🗞️ Paper : https://arxiv.org/pdf/2508.18265
↗️ Multimodal Space Collection : prithivMLmods/multimodal-implementations-67c9982ea04b39f0608badb0

To learn more, visit the relevant spaces, collections, and model cards.

1 reply

Salim Al Sazu

AI & ML interests

Recent Activity

Organizations

salimalsazu's activity