Leandro von Werra's picture

Leandro von Werra

lvwerra

AI & ML interests

NLP and RL

Recent Activity

updated a dataset 13 days ago
lvwerra/admin
View all activity

Organizations

Hugging Face's profile picture Natural Language Processing with Transformers's profile picture BigScience Workshop's profile picture Spaces-explorers's profile picture Hugging Face Course's profile picture BigScience Catalogue Data's profile picture PubMed Central's profile picture BigScience Data's profile picture trl internal testing's profile picture evaluate's profile picture Data Days Zurich's profile picture Evaluate Comparison's profile picture Evaluate Metric's profile picture HuggingFaceM4's profile picture Evaluate Measurement's profile picture scikit-learn's profile picture TRL's profile picture CodeParrot's profile picture BigCode's profile picture CompVis's profile picture Hugging Face H4's profile picture Hugging Face OSS Metrics's profile picture BigBang's profile picture transfer-test-target's profile picture CompVis Community's profile picture Sphere Fall 2022's profile picture BigCode Data's profile picture Stack Overflow's profile picture Reading Group's profile picture Hugging Face Extreme-Scale's profile picture Need4Speed's profile picture Code Llama's profile picture Personal Coding Assistant's profile picture Hugging Face Smol Models Research's profile picture Hugging Face Smol Cluster's profile picture Open LLM Leaderboard's profile picture gg-hf's profile picture Nanotron Research's profile picture Hugging Face SMOL's profile picture HuggingFaceFW's profile picture bigcode nvidia's profile picture hsramall's profile picture mlo-data-cleaning's profile picture HuggingFaceFW-Dev's profile picture StarCoder2 Data's profile picture Data Agents's profile picture CinePile collaboration's profile picture Hugging Face FineVideo's profile picture smol-explorers's profile picture swissai-hf-data's profile picture abcd4321's profile picture Hugging Face Science's profile picture eggs's profile picture LeMaterial's profile picture Your Bench's profile picture Open R1's profile picture Open Agents's profile picture

lvwerra's activity

published an article about 2 months ago
view article
Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

By eggie5 and 5 others
67
published an article about 2 months ago
view article
Article

Open-R1: a fully open reproduction of DeepSeek-R1

By eliebak and 2 others
822
published an article 4 months ago
view article
Article

LeMaterial: an open source initiative to accelerate materials discovery and research

By AlexDuvalinho and 9 others
43
published an article 5 months ago
view article
Article

CinePile 2.0 - making stronger datasets with adversarial refinement

By mfarre and 3 others
15
published an article 6 months ago
view article
Article

FineVideo: behind the scenes

By mfarre and 5 others
30
published an article 6 months ago
view article
Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

By medmekk and 5 others
227
published an article 7 months ago
view article
Article

A failed experiment: Infini-Attention, and why we should keep trying?

By neuralink and 2 others
60
published an article 8 months ago
view article
Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

By philschmid and 7 others
231
published an article 9 months ago
view article
Article

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

By terryyz and 8 others
46
published an article 11 months ago
view article
Article

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

By yuxiang630 and 8 others
76
published an article 11 months ago
published an article about 1 year ago
published an article about 1 year ago
view article
Article

Constitutional AI with Open LLMs

By vwxyzjn and 6 others
13
published an article about 1 year ago
view article
Article

Preference Tuning LLMs with Direct Preference Optimization Methods

By kashif and 4 others
50
published an article over 1 year ago
view article
Article

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

By lewtun and 6 others
12
published an article over 1 year ago
view article
Article

The N Implementation Details of RLHF with PPO

By vwxyzjn and 2 others
45
published an article over 1 year ago
published an article over 1 year ago