Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 236
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper • 2503.07920 • Published 17 days ago • 95
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 16 days ago • 351
view article Article HuggingFace, IISc partner to supercharge model building on India's diverse languages 29 days ago • 17
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality 24 days ago • 69
Token-Efficient Long Video Understanding for Multimodal LLMs Paper • 2503.04130 • Published 22 days ago • 86
EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published 21 days ago • 75
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 • 211
SimpleRL Collection The collection for the Project "Simple Reinforcement Learning for Reasoning" • 2 items • Updated Feb 19 • 6
CodeI/O Collection Collection for CodeI/O @ https://codei-o.github.io/ • 15 items • Updated Feb 13 • 6
NuminaMath Collection Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 7 items • Updated Feb 10 • 76
Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2 Paper • 2502.03544 • Published Feb 5 • 43