view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference 14 days ago โข 61
view article Article Announcing NVIDIA Cosmos World Foundation Models By mingyuliutw โข 23 days ago โข 23
Health AI Developer Foundations (HAI-DEF) Collection Groups models released for use in health AI by Google. Read more about HAI-DEF at https://developers.google.com/health-ai-developer-foundations โข 3 items โข Updated Dec 13, 2024 โข 21
Llama 3.3 Collection This collection hosts the transformers and original repos of the Llama 3.3 โข 1 item โข Updated Dec 6, 2024 โข 121
AMD-OLMo Collection AMD-OLMo are a series of 1 billion parameter language models trained by AMD on AMD Instinctโข MI250 GPUs based on OLMo. โข 4 items โข Updated Oct 31, 2024 โข 18
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 โข 9 items โข Updated Nov 27, 2024 โข 104
view article Article Optimum-NVIDIA - Unlock blazingly fast LLM inference in just 1 line of code Dec 5, 2023 โข 4