view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • Jul 8 • 643
From Tokens to Thoughts: How LLMs and Humans Trade Compression for Meaning Paper • 2505.17117 • Published May 21 • 1
Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding Paper • 2506.16035 • Published Jun 19 • 88
ZClip: Adaptive Spike Mitigation for LLM Pre-Training Paper • 2504.02507 • Published Apr 3 • 90
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 By manu • Jul 5, 2024 • 290
Komodo: A Linguistic Expedition into Indonesia's Regional Languages Paper • 2403.09362 • Published Mar 14, 2024 • 11
Beyond Extraction: Contextualising Tabular Data for Efficient Summarisation by Language Models Paper • 2401.02333 • Published Jan 4, 2024 • 7
InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining Paper • 2310.07713 • Published Oct 11, 2023 • 3