Federico Minutoli

DiTo97

DiTo97

AI & ML interests

anything machine learning. I am strongly passionate in computer vision and robotics, and how machine learning will help achieve autonomous behavior, perception and continuous learning.

Recent Activity

upvoted a paper 9 days ago

MMGR: Multi-Modal Generative Reasoning

upvoted an article 15 days ago

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

new activity 4 months ago

LiquidAI/LFM2-VL-450M:ValueError: Image features and image tokens do not match: tokens: 9728, features 10240 mb

View all activity

Organizations

upvoted a paper 9 days ago

MMGR: Multi-Modal Generative Reasoning

Paper • 2512.14691 • Published 11 days ago • 114

upvoted an article 15 days ago

Article

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

19 days ago

•

New activity in LiquidAI/LFM2-VL-450M 4 months ago

ValueError: Image features and image tokens do not match: tokens: 9728, features 10240 mb

#1 opened 4 months ago by

DiTo97

upvoted a paper 6 months ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Paper • 2506.24119 • Published Jun 30 • 50

upvoted a paper 10 months ago

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Paper • 2503.04724 • Published Mar 6 • 72

liked a dataset 11 months ago

faweigend/wearmocap

Preview • Updated Jan 7 • 247 • 1

upvoted an article about 1 year ago

Article

Deriving DPO's Loss

Dec 24, 2024

•

upvoted a paper about 1 year ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 147

New activity in scrapegraphai/AQL-v1-QA about 1 year ago

[bot] Conversion to Parquet

#1 opened about 1 year ago by

parquet-converter

upvoted a paper about 1 year ago

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Paper • 2411.02337 • Published Nov 4, 2024 • 36

upvoted a paper over 1 year ago

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16, 2024 • 101

updated 6 models over 1 year ago

updated a dataset over 1 year ago

scrapegraphai/AQL-v1-QA

Viewer • Updated Jun 25, 2024 • 8.76k • 43

upvoted an article over 1 year ago

Article

License to Call: Introducing Transformers Agents 2.0

May 13, 2024

•

137

upvoted a paper over 1 year ago

LEGENT: Open Platform for Embodied Agents

Paper • 2404.18243 • Published Apr 28, 2024 • 22

Federico Minutoli

AI & ML interests

Recent Activity

Organizations

DiTo97's activity

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

ValueError: Image features and image tokens do not match: tokens: 9728, features 10240 mb

Deriving DPO's Loss

[bot] Conversion to Parquet

License to Call: Introducing Transformers Agents 2.0