Clémentine Fourrier

clefourrier

AI & ML interests

None yet

Recent Activity

Organizations

Hugging Face's profile picture Long Range Graph Benchmark's profile picture Evaluation datasets's profile picture BigScience: LMs for Historical Texts's profile picture HuggingFaceBR4's profile picture Huggingface Projects's profile picture Open Graph Benchmark's profile picture HuggingFaceGECLM's profile picture Pretrained Graph Transformers's profile picture Graph Datasets's profile picture BigCode's profile picture Hugging Face H4's profile picture InternLM's profile picture Vectara's profile picture GAIA's profile picture Hugging Face Smol Cluster's profile picture plfe's profile picture Open LLM Leaderboard's profile picture Qwen's profile picture Secure Learning Lab's profile picture Open Life Science AI's profile picture LLM360's profile picture TTS Eval (OLD)'s profile picture hallucinations-leaderboard's profile picture Bias Leaderboard Development's profile picture Leaderboard Organization's profile picture Demo Leaderboard's profile picture Demo leaderboard with an integrated backend's profile picture gg-hf's profile picture AIM-Harvard's profile picture Clinical & Biomedical ML Leaderboards's profile picture Women on Hugging Face's profile picture LMLLO2's profile picture Lighthouz AI's profile picture Open Arabic LLM Leaderboard's profile picture mx-test's profile picture IBM Granite's profile picture HuggingFaceFW's profile picture HF-contamination-detection's profile picture TTS AGI's profile picture Leader Board Test Org's profile picture Social Post Explorers's profile picture hsramall's profile picture Open RL Leaderboard's profile picture The Fin AI's profile picture La Leaderboard's profile picture Open Hebrew LLM's Leaderboard's profile picture gg-tt's profile picture HuggingFaceEval's profile picture HP Inc.'s profile picture Novel Challenge's profile picture Open LLM Leaderboard Archive's profile picture LLHF's profile picture SLLHF's profile picture lbhf's profile picture Inception's profile picture nltpt's profile picture Lighteval testing org's profile picture CléMax's profile picture Hugging Face Science's profile picture test_org's profile picture Coordination Nationale pour l'IA's profile picture LeMaterial's profile picture open-llm-leaderboard-react's profile picture Prompt Leaderboard's profile picture wut?'s profile picture UBC-NLP Collaborations's profile picture smolagents's profile picture Your Bench's profile picture leaderboard explorer's profile picture Open R1's profile picture SIMS's profile picture OpenEvals's profile picture

clefourrier's activity

published an article about 1 month ago
view article
Article

Fixing Open LLM Leaderboard with Math-Verify

By hynky and 3 others
27
published an article about 1 month ago
published an article about 2 months ago
view article
Article

Open-source DeepResearch – Freeing our search agents

By m-ric and 4 others
1.18k
published an article 2 months ago
view article
Article

CO₂ Emissions and Models Performance: Insights from the Open LLM Leaderboard

By alozowski and 3 others
21
published an article 4 months ago
view article
Article

Rethinking LLM Evaluation with 3C3H: AraGen Benchmark and Leaderboard

By alielfilali01 and 4 others
34
published an article 4 months ago
view article
Article

Introduction to the Open Leaderboard for Japanese LLMs

35
published an article 4 months ago
view article
Article

Letting Large Models Debate: The First Multilingual LLM Debate Competition

By xuanricheng and 11 others
30
published an article 4 months ago
view article
Article

Judge Arena: Benchmarking LLMs as Evaluators

By kaikaidai and 7 others
56
published an article 6 months ago
published an article 9 months ago
view article
Article

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

By terryyz and 8 others
46
published an article 10 months ago
view article
Article

Falcon 2: An 11B parameter pretrained language model and VLM, trained on over 5000B tokens tokens and 11 languages

By Quent-01 and 9 others
25
published an article 10 months ago
view article
Article

CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models

By r34p3r1321 and 15 others
21
published an article 10 months ago
published an article 10 months ago
published an article 11 months ago
view article
Article

Introducing the Open Leaderboard for Hebrew LLMs!

By Shaltiel and 3 others
38
published an article 11 months ago
view article
Article

Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face

By mhillsmith and 2 others
13
published an article 11 months ago
view article
Article

Improving Prompt Consistency with Structured Generations

By willkurt and 2 others
62
published an article 11 months ago
view article
Article

Introducing the Open Chain of Thought Leaderboard

By ggbetz and 3 others
30
published an article 11 months ago
view article
Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

By aaditya and 2 others
140
published an article 11 months ago
view article
Article

Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs

By StringChaos and 6 others
15