Sergio Paniego's picture

Sergio Paniego PRO

sergiopaniego

·

https://sergiopaniego.github.io/

AI & ML interests

None yet

Recent Activity

new activity 1 day ago

google/gemma-3n-E4B-it:Audio example

updated a Space 1 day ago

sergiopaniego/vlm_object_understanding

new activity 1 day ago

huggingface-projects/gemma-3n-E4B-it:Add video example

View all activity

Organizations

liked a model 1 day ago

vikhyatk/moondream2

Image-Text-to-Text • 2B • Updated 4 days ago • 307k • 1.18k

liked a Space 2 days ago

Gemma 3n E4B It

Describe images, videos, and audio

liked a Space 3 days ago

LightGlue

LightGlue demo

liked 2 datasets 3 days ago

merve/vlm_test_images

Viewer • Updated 2 days ago • 15 • 333 • 2

lmms-lab/Video-MME

Viewer • Updated Jul 4, 2024 • 2.7k • 12.1k • 47

liked a dataset 10 days ago

ariG23498/coco2017

Viewer • Updated 3 days ago • 122k • 178 • 3

liked 2 Spaces 12 days ago

VisionZip

EfficientVLM

CVPR2025

Search and filter CVPR 2025 papers

liked a dataset 19 days ago

nvidia/Nemotron-Personas

Viewer • Updated 19 days ago • 100k • 18.5k • 150

liked a Space about 1 month ago

comparevlms

Compare vision language models

liked a model about 1 month ago

deepseek-ai/DeepSeek-R1-0528

Text Generation • 685B • Updated about 1 month ago • 173k • • 2.13k

liked a Space about 1 month ago

Gemma3 License Plate Detection

Gemma 3 for license plate detection

liked a dataset about 1 month ago

TIGER-Lab/VideoEval-Pro

Viewer • Updated 30 days ago • 1.29k • 398 • 3

liked 2 Spaces about 1 month ago

VideoEval-Pro Leaderboard

A more robust benchmark for long video understanding.

Nanovlm

A space for nanoVLM model

liked a model about 1 month ago

multimodalart/isometric-skeumorphic-3d-bnb

Text-to-Image • Updated May 15 • 1.94k • • 331

liked a model about 2 months ago

ariG23498/gemma-3-4b-pt-object-detection

Object Detection • 4B • Updated 6 days ago • 1.81k • 4

liked a Space about 2 months ago

Qwen Od

Detect objects in images from URL or upload

liked 2 models about 2 months ago

lusxvr/nanoVLM-222M

Image-Text-to-Text • 0.2B • Updated May 8 • 2.21k • 89

Freepik/F-Lite-Texture

Text-to-Image • Updated Apr 29 • 79 • 22