view article Article The Diffusion Principle Understood From The Perspective Of Convolution By refoundd β’ 2 days ago β’ 4
view article Article Timm β€οΈ Transformers: Use any timm model with transformers 2 days ago β’ 24
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 3 days ago β’ 98
view article Article MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era By MiniMax-AI β’ 3 days ago β’ 32
Visual Document Retrieval Collection A collection of models, datasets, and spaces in the VDR series β’ 5 items β’ Updated 8 days ago β’ 8
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper β’ 2305.18290 β’ Published May 29, 2023 β’ 52
Enhancing Human-Like Responses in Large Language Models Paper β’ 2501.05032 β’ Published 9 days ago β’ 46
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos Paper β’ 2501.04001 β’ Published 10 days ago β’ 40
Cosmos Collection The collection of Cosmos models β’ 31 items β’ Updated about 24 hours ago β’ 233
view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM By davidberenstein1957 β’ 15 days ago β’ 30
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Paper β’ 2412.18925 β’ Published 24 days ago β’ 94
view article Article πΊπ¦ββ¬ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram β’ Dec 4, 2024 β’ 76
view article Article Towards a Fully Arabic Retrieval-Augmented Generation (RAG) Pipeline: By Omartificial-Intelligence-Space β’ Nov 30, 2024 β’ 7