Adaptive Orchestration for Large-Scale Inference on Heterogeneous Accelerator Systems Balancing Cost, Performance, and Resilience Paper • 2503.20074 • Published 3 days ago • 3
view article Article NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets 11 days ago • 30
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 17 days ago • 352
Phi-4 Collection Phi-4 family of small language and multi-modal models. • 7 items • Updated 26 days ago • 112
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub Feb 12 • 55
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference Jan 16 • 71
Health AI Developer Foundations (HAI-DEF) Collection Groups models released for use in health AI by Google. Read more about HAI-DEF at https://developers.google.com/health-ai-developer-foundations • 9 items • Updated 2 days ago • 30
Llama 3.3 Collection This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 149