Common Models Collection The first generation of models pretrained on Common Corpus. β’ 5 items β’ Updated Dec 5, 2024 β’ 39
Pleias-RAG Collection New generation of small reasoning models for RAG, search, and source summarization. β’ 4 items β’ Updated Apr 24 β’ 27
view article Article The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare By aaditya and 2 others β’ Apr 19, 2024 β’ 164
olmOCR Collection olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org β’ 3 items β’ Updated 22 days ago β’ 114
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. β’ 40 items β’ Updated 21 days ago β’ 86
Running on CPU Upgrade 65 65 Leaderboard LLM FR π Track, rank and evaluate open LLMs and chatbots in French
Running 962 962 FineWeb: decanting the web for the finest text data at scale π· Generate high-quality web text data for LLM training
view article Article How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents By Steveeeeeeen β’ Jan 29 β’ 17