140 75 34

Sergio Paniego PRO

sergiopaniego

https://sergiopaniego.github.io/

AI & ML interests

None yet

Recent Activity

liked a Space 8 days ago

visionLMsftw/VLMVibeEval

updated a Space 8 days ago

visionLMsftw/VLMVibeEval

new activity 8 days ago

visionLMsftw/VLMVibeEval:IU improvement

View all activity

Organizations

sergiopaniego's activity

upvoted an article 9 days ago

Article

CodeAgents + Structure: A Better Way to Execute Actions

and 1 other •

10 days ago

• 43

upvoted a paper 11 days ago

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Paper • 2505.15966 • Published 16 days ago • 51

upvoted an article 11 days ago

Article

Interactive Tools for machine learning, deep learning, and math

•

11 days ago

• 40

upvoted a paper 12 days ago

VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation

Paper • 2505.14640 • Published 17 days ago • 14

upvoted an article 16 days ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

and 6 others •

17 days ago

• 140

upvoted an article 17 days ago

Article

Microsoft and Hugging Face expand collaboration

and 2 others •

19 days ago

• 20

upvoted an article 21 days ago

Article

TinyAgents: A Minimal Experiment with Code Agents and MCP Tools

•

22 days ago

• 29

upvoted an article 22 days ago

Article

The Transformers Library: standardizing model definitions

and 3 others •

23 days ago

• 112

upvoted a collection 24 days ago

Gemma 3 Object Detection

Collection

3 items • Updated 24 days ago • 3

upvoted an article 25 days ago

Article

Vision Language Models (Better, Faster, Stronger)

and 4 others •

26 days ago

• 417

upvoted an article 26 days ago

Article

LeRobot Community Datasets: The “ImageNet” of Robotics — When and How?

and 6 others •

27 days ago

• 57

upvoted an article 29 days ago

Article

Page-to-Video: Generate videos from webpages 🪄🎬

•

May 6

• 27

upvoted 4 articles about 1 month ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

and 3 others •

Feb 4

• 158

Article

How to Build an MCP Server with Gradio

and 1 other •

Apr 30

• 162

Article

Welcoming Llama Guard 4 on Hugging Face Hub

and 3 others •

Apr 29

• 37

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

•

Apr 25

• 267

upvoted 3 articles about 2 months ago

Article

Gotchas in Tokenizer Behavior Every Developer Should Know

•

Apr 18

• 37

Article

Comparing sub 50GB Llama 4 Scout quants (KLD/Top P)

•

Apr 9

• 40

Article

Mixture of Experts Explained

and 5 others •

Dec 11, 2023

• 666

upvoted a paper about 2 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 188