view article Article Supercharge Edge AI With HighβAccuracy Reasoning Using NVIDIA Nemotron Nano 2 9B By nvidia and 9 others β’ Aug 18 β’ 29
NVILA: Efficient Frontier Visual Language Models Paper β’ 2412.04468 β’ Published Dec 5, 2024 β’ 59
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models Paper β’ 2409.17481 β’ Published Sep 26, 2024 β’ 47
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models Paper β’ 2409.17481 β’ Published Sep 26, 2024 β’ 47
LLM Pruning and Distillation in Practice: The Minitron Approach Paper β’ 2408.11796 β’ Published Aug 21, 2024 β’ 57
nvidia/Mistral-NeMo-Minitron-8B-Base Text Generation β’ 8B β’ Updated Aug 22, 2024 β’ 3.94k β’ 177
LLM Pruning and Distillation in Practice: The Minitron Approach Paper β’ 2408.11796 β’ Published Aug 21, 2024 β’ 57