Leandro von Werra's picture

Leandro von Werra

lvwerra

huggingface

·

https://www.lvwerra.com

AI & ML interests

NLP and RL

Recent Activity

liked a model 12 days ago

nvidia/nemotron-ocr-v1

upvoted an article 16 days ago

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

liked a Space 17 days ago

lvwerra/jagged-data-frontier

View all activity

Organizations

published an article 2 months ago

Article

Unlock the power of images with AI Sheets

+4

Oct 21, 2025

•

33

published an article 4 months ago

Article

Jupyter Agents: training LLMs to reason with notebooks

+1

Sep 10, 2025

•

58

published an article 5 months ago

Article

Introducing AI Sheets: a tool to work with datasets using open AI models!

+4

Aug 8, 2025

•

106

published an article 6 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

741

published an article 9 months ago

Article

Open R1: Update #4

Mar 26, 2025

•

48

published an article 10 months ago

Article

Open R1: Update #3

Mar 11, 2025

•

296

published an article 11 months ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

+4

Feb 4, 2025

•

121

published an article 11 months ago

Article

Open-R1: Update #1

Feb 2, 2025

•

305

published an article 11 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

+1

Jan 28, 2025

•

887

published an article about 1 year ago

Article

LeMaterial: an open source initiative to accelerate materials discovery and research

+8

Dec 10, 2024

•

54

published an article about 1 year ago

Article

CinePile 2.0 - making stronger datasets with adversarial refinement

+2

Oct 23, 2024

•

18

published an article over 1 year ago

Article

FineVideo: behind the scenes

+4

Sep 23, 2024

•

35

published an article over 1 year ago

Article

FineVideo: behind the scenes

+4

Sep 23, 2024

•

35

published an article over 1 year ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

+4

Sep 18, 2024

•

272

published an article over 1 year ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

+1

Aug 14, 2024

•

73

published an article over 1 year ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

+6

Jul 23, 2024

•

241

published an article over 1 year ago

Article

BigCodeBench: The Next Generation of HumanEval

+7

Jun 18, 2024

•

52

published an article over 1 year ago

Article

BigCodeBench: The Next Generation of HumanEval

+7

Jun 18, 2024

•

52

published an article over 1 year ago

Article

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

+7

Apr 29, 2024

•

79

published an article over 1 year ago

Article

Welcome Llama 3 - Meta's new open LLM

+3

Apr 18, 2024

•

295