PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters Paper β’ 2504.08791 β’ Published Apr 7 β’ 133
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. β’ 32 items β’ Updated May 30 β’ 148
SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters Paper β’ 2412.00174 β’ Published Nov 29, 2024 β’ 23
view article Article wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR?? By catherinearnett β’ Sep 27, 2024 β’ 46
Llama 3.2 3B & 1B GGUF Quants Collection Llama.cpp compatible quants for Llama 3.2 3B and 1B Instruct models. β’ 4 items β’ Updated Sep 26, 2024 β’ 46
Zero-shot Cross-lingual Voice Transfer for TTS Paper β’ 2409.13910 β’ Published Sep 20, 2024 β’ 10
LVCD: Reference-based Lineart Video Colorization with Diffusion Models Paper β’ 2409.12960 β’ Published Sep 19, 2024 β’ 25
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Paper β’ 2408.06292 β’ Published Aug 12, 2024 β’ 126
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models β’ 11 items β’ Updated Dec 6, 2024 β’ 679
view article Article ColPali: Efficient Document Retrieval with Vision Language Models π By manu β’ Jul 5, 2024 β’ 268
view article Article Welcome Llama 3 - Meta's new open LLM By philschmid and 4 others β’ Apr 18, 2024 β’ 289
Sora Reference Papers Collection A collection of all papers referenced in OpenAI's "Video generation models as world simulators" technical report β’ openai.com/sora β’ 30 items β’ Updated Oct 3, 2024 β’ 52
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection Paper β’ 2403.03507 β’ Published Mar 6, 2024 β’ 189
Text-to-Image Base Models Collection All text-to-image open source base models, with their respective license β’ 28 items β’ Updated May 10, 2024 β’ 25
SALMONN: Towards Generic Hearing Abilities for Large Language Models Paper β’ 2310.13289 β’ Published Oct 20, 2023 β’ 17
MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models Paper β’ 2402.06178 β’ Published Feb 9, 2024 β’ 15
Qwen1.5 Collection Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. β’ 55 items β’ Updated Apr 28 β’ 209