view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM By ariG23498 and 3 others โข Mar 12 โข 429
view article Article Train 400x faster Static Embedding Models with Sentence Transformers By tomaarsen โข Jan 15 โข 187
Running 2.67k 2.67k The Ultra-Scale Playbook ๐ The ultimate guide to training LLM on large GPU Clusters
view article Article Introducing the Open Arabic LLM Leaderboard By alielfilali01 and 4 others โข May 14, 2024 โข 92
unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF Text Generation โข Updated about 1 month ago โข 38.8k โข 270
view article Article ๐ณ๏ธ Attention Sinks in LLMs for endless fluency By tomaarsen โข Oct 9, 2023 โข 10
Vision Language Models Papers ๐ผ๏ธ๐ฌ๐ Collection Papers about vision-language models, most important ones are on top of the list. โข 27 items โข Updated Apr 30, 2024 โข 36
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context Paper โข 2403.05530 โข Published Mar 8, 2024 โข 65
view post Post ๐ I'm SamI use ML and HPC to accelerate scientific discovery @ Argonne National Laboratory*https://samforeman.mehttps://twitter.com/saforem2https://github.com/saforem2* https://alcf.anl.gov/about/people/sam-foreman ๐ค 5 5 + Reply