Leandro von Werra's picture

Leandro von Werra

lvwerra

·

https://github.com/lvwerra

AI & ML interests

NLP and RL

Recent Activity

upvoted a paper about 16 hours ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

upvoted an article 6 days ago

CodeAgents + Structure: A Better Way to Execute Actions

updated a Space 15 days ago

data-agents/jupyter-agent

View all activity

Organizations

lvwerra's activity

liked a Space 15 days ago

WikiRacing Language Models

Find answers by racing against LLM in a quiz game

liked a Space 22 days ago

Sheets

Create a dataset

liked a model 28 days ago

ServiceNow-AI/Apriel-Nemotron-15b-Thinker

Text Generation • Updated 20 days ago • 2.42k • 87

liked a Space 29 days ago

Computer Agent

Interact with an agent to perform web-based tasks

liked a model about 1 month ago

Qwen/Qwen3-235B-A22B

Text Generation • Updated 14 days ago • 186k • • 922

liked a Space about 1 month ago

Dia 1.6B

Generate realistic dialogue from a script, using Dia!

liked a model about 1 month ago

lldacing/flash-attention-windows-wheel

Updated 4 days ago • 158

liked 2 models about 2 months ago

ds4sd/SmolDocling-256M-preview

Image-Text-to-Text • Updated 19 days ago • 302k • 1.41k

rasbt/llama-3.2-from-scratch

Updated Apr 16 • 276

liked a Space 2 months ago

Try YourBench!

Generate a custom benchmark from any document

liked 3 Spaces 3 months ago

QwQ 32B Demo

Send text and get detailed responses

Open LLM Progress Tracker

Visualize Open vs. Proprietary LLM Progress

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

liked a Space 4 months ago

DABstep Leaderboard

DABstep Reasoning Benchmark Leaderboard

liked a model 4 months ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated Mar 27 • 684k • • 12.3k

liked a Space 5 months ago

Jupyter Agent

Generate code solutions interactively

liked a Space 6 months ago

Scaling test-time compute

Enhance math problem solving by scaling test-time compute

liked 2 datasets 6 months ago

microsoft/RedStone

Updated Dec 5, 2024 • 17 • 34

ylecun/mnist

Viewer • Updated Aug 8, 2024 • 70k • 39.2k • 176

liked a Space 7 months ago

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

Evaluate multilingual models using FineTasks