Running 988 988 FineWeb: decanting the web for the finest text data at scale π· Generate high-quality web text data for LLM training
DocLLM: A layout-aware generative language model for multimodal document understanding Paper β’ 2401.00908 β’ Published Dec 31, 2023 β’ 189
Restarting on CPU Upgrade 562 562 Open Ko-LLM Leaderboard π Explore and filter language model benchmark results