Nemotron-H Collection Mamba-Transformer hybrid models β’ 5 items β’ Updated about 20 hours ago β’ 9
RADIO Collection A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). β’ 12 items β’ Updated about 16 hours ago β’ 16
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory β’ 8 items β’ Updated 12 days ago β’ 116
SVDQuant Collection Models and datasets for "SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models" β’ 20 items β’ Updated 29 days ago β’ 22
distil-large-v3.5 Collection This collection contains the model repositories for distil-large-v3.5, which provides support for the most popular Whisper libraries. β’ 5 items β’ Updated 21 days ago β’ 7
Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency Paper β’ 2503.20785 β’ Published 20 days ago β’ 21
ConsisID Collection Identity-Preserving Text-to-Video Generation by Frequency Decomposition β’ 4 items β’ Updated Dec 3, 2024 β’ 12
Enabling Versatile Controls for Video Diffusion Models Paper β’ 2503.16983 β’ Published 25 days ago β’ 14
SANA-Sprint Collection πSANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation β’ 6 items β’ Updated 9 days ago β’ 33
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation Paper β’ 2503.09641 β’ Published Mar 12 β’ 36
view article Article Welcome PaliGemma 2 β New vision language models by Google Dec 5, 2024 β’ 151