SigLIP Collection Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 β’ 10 items β’ Updated Dec 13, 2024 β’ 50
Cultura-Ru-Edu Collection Our dataset for enhancing LLM training with educational content in the Russian language. β’ 2 items β’ Updated Nov 26, 2024 β’ 5
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Paper β’ 1810.04805 β’ Published Oct 11, 2018 β’ 16
view article Article Letβs make a generation of amazing image generation models By burtenshaw β’ Nov 26, 2024 β’ 34
SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration Paper β’ 2411.10958 β’ Published Nov 17, 2024 β’ 52
Large Language Models Can Self-Improve in Long-context Reasoning Paper β’ 2411.08147 β’ Published Nov 12, 2024 β’ 63
RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning Paper β’ 2410.02089 β’ Published Oct 2, 2024 β’ 12
OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization Paper β’ 2410.19609 β’ Published Oct 25, 2024 β’ 17
MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark Paper β’ 2410.19168 β’ Published Oct 24, 2024 β’ 19
LOGO -- Long cOntext aliGnment via efficient preference Optimization Paper β’ 2410.18533 β’ Published Oct 24, 2024 β’ 42
MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models Paper β’ 2410.17637 β’ Published Oct 23, 2024 β’ 34
Addition is All You Need for Energy-efficient Language Models Paper β’ 2410.00907 β’ Published Oct 1, 2024 β’ 145
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models Paper β’ 2410.05229 β’ Published Oct 7, 2024 β’ 22
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts Paper β’ 2410.10626 β’ Published Oct 14, 2024 β’ 38
Textbooks Are All You Need II: phi-1.5 technical report Paper β’ 2309.05463 β’ Published Sep 11, 2023 β’ 87
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models Paper β’ 2410.07985 β’ Published Oct 10, 2024 β’ 28
LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content Paper β’ 2410.10783 β’ Published Oct 14, 2024 β’ 26
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models Paper β’ 2410.10139 β’ Published Oct 14, 2024 β’ 51