InsightTok: Improving Text and Face Fidelity in Discrete Tokenization for Autoregressive Image Generation Paper β’ 2605.14333 β’ Published 29 days ago β’ 35
Running 94 Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks π 94 Evaluate multilingual models using FineTasks
Running on CPU Upgrade 246 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens π 246 Explore synthetic data benchmarks via an interactive bookshelf
AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation Paper β’ 2605.13724 β’ Published about 1 month ago β’ 101
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head Paper β’ 2601.07832 β’ Published Jan 12 β’ 53
Running Featured 1.36k FineWeb: decanting the web for the finest text data at scale π· 1.36k Explore and download the FineWeb webβscale text dataset
Running 3.88k The Ultra-Scale Playbook π 3.88k The ultimate guide to training LLM on large GPU Clusters
Running on CPU Upgrade Featured 3.2k The Smol Training Playbook π 3.2k The secrets to building world-class LLMs