78 85 178

Andres Marafioti

andito

AI & ML interests

Multimodal models, VLM and TTS

Recent Activity

liked a model about 12 hours ago

HuggingFaceTB/SmolLM2-135M-Instruct

updated a model about 15 hours ago

andito/nanoVLM

liked a Space about 18 hours ago

webml-community/conversational-webgpu

View all activity

Organizations

andito's activity

upvoted 3 articles 2 days ago

Article

Daily Robotics June #1 - SmolVLA discovery and thoughts

•

3 days ago

• 9

Article

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

and 1 other •

3 days ago

• 60

Article

KV Cache from scratch in nanoVLM

and 4 others •

3 days ago

• 55

upvoted an article 3 days ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

and 8 others •

4 days ago

• 94

upvoted a paper 4 days ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published 4 days ago • 74

upvoted an article 9 days ago

Article

CodeAgents + Structure: A Better Way to Execute Actions

and 1 other •

10 days ago

• 43

upvoted 2 articles 16 days ago

Article

Exploring Quantization Backends in Diffusers

and 2 others •

17 days ago

• 32

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

and 6 others •

17 days ago

• 140

upvoted an article 17 days ago

Article

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

and 5 others •

17 days ago

• 26

upvoted an article 23 days ago

Article

Highlights from the First ICLR 2025 Watermarking Workshop

and 4 others •

23 days ago

• 11

upvoted an article 25 days ago

Article

Vision Language Models (Better, Faster, Stronger)

and 4 others •

26 days ago

• 417

upvoted an article 26 days ago

Article

LeRobot Community Datasets: The “ImageNet” of Robotics — When and How?

and 6 others •

27 days ago

• 57

upvoted a paper 26 days ago

UniVLA: Learning to Act Anywhere with Task-centric Latent Actions

Paper • 2505.06111 • Published 28 days ago • 24

upvoted 2 articles about 1 month ago

Article

Welcoming Llama Guard 4 on Hugging Face Hub

and 3 others •

Apr 29

• 37

Article

Cohere on Hugging Face Inference Providers 🔥

and 6 others •

Apr 16

• 126

upvoted 3 articles about 2 months ago

Article

Comparing sub 50GB Llama 4 Scout quants (KLD/Top P)

•

Apr 9

• 40

Article

Advancing European AI Sovereignty Through Racine.ai Flantier Open-Source Multimodal Models

•

Mar 26

• 10

Article

DeepSearch Using Visual RAG in Agentic Frameworks 🔎

and 1 other •

Mar 21

• 33

upvoted 2 papers about 2 months ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 285

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published Apr 7 • 105