Faster Video Diffusion with Trainable Sparse Attention Paper • 2505.13389 • Published 23 days ago • 35
Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples Paper • 2502.09650 • Published Feb 11
LLM as a Broken Telephone: Iterative Generation Distorts Information Paper • 2502.20258 • Published Feb 27 • 27
LLM as a Broken Telephone: Iterative Generation Distorts Information Paper • 2502.20258 • Published Feb 27 • 27
Compression, Transduction, and Creation: A Unified Framework for Evaluating Natural Language Generation Paper • 2109.06379 • Published Sep 14, 2021
Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs Paper • 2406.20098 • Published Jun 28, 2024
O1 Replication Journey: A Strategic Progress Report -- Part 1 Paper • 2410.18982 • Published Oct 8, 2024 • 3
Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models Paper • 2308.16149 • Published Aug 30, 2023 • 28
JuriBERT: A Masked-Language Model Adaptation for French Legal Text Paper • 2110.01485 • Published Oct 4, 2021
Atlas-Chat: Adapting Large Language Models for Low-Resource Moroccan Arabic Dialect Paper • 2409.17912 • Published Sep 26, 2024 • 29
GreekBART: The First Pretrained Greek Sequence-to-Sequence Model Paper • 2304.00869 • Published Apr 3, 2023
Atlas-Chat: Adapting Large Language Models for Low-Resource Moroccan Arabic Dialect Paper • 2409.17912 • Published Sep 26, 2024 • 29
Prot2Text: Multimodal Protein's Function Generation with GNNs and Transformers Paper • 2307.14367 • Published Jul 25, 2023 • 3
The Curious Decline of Linguistic Diversity: Training Language Models on Synthetic Text Paper • 2311.09807 • Published Nov 16, 2023 • 1
Pandora: Towards General World Model with Natural Language Actions and Video States Paper • 2406.09455 • Published Jun 12, 2024 • 15
SlimPajama-DC: Understanding Data Combinations for LLM Training Paper • 2309.10818 • Published Sep 19, 2023 • 11
Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models Paper • 2308.16149 • Published Aug 30, 2023 • 28