Scaling RL to Long Videos
Efficient-Large-Model
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
SANA-1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
-
SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
Paper β’ 2501.18427 β’ Published β’ 21 -
Efficient-Large-Model/SANA1.5_4.8B_1024px
Text-to-Image β’ Updated β’ 90 β’ β’ 22 -
Efficient-Large-Model/SANA1.5_4.8B_1024px_diffusers
Text-to-Image β’ Updated β’ β’ 13 -
Efficient-Large-Model/SANA1.5_1.6B_1024px
Text-to-Image β’ Updated β’ 584 β’ β’ 1
A series of VILA models that specialize for **long-context** abilities
-
Efficient-Large-Model/NVILA-15B
Text Generation β’ Updated β’ 66.9k β’ 21 -
Efficient-Large-Model/NVILA-Lite-15B
Text Generation β’ Updated β’ 622 β’ 4 -
Efficient-Large-Model/NVILA-Lite-8B
Text Generation β’ Updated β’ 6.86k β’ 2 -
Efficient-Large-Model/NVILA-Lite-8B-stage2
Text Generation β’ Updated β’ 215 β’ 1
πSANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
-
412
SanaSprint
πUltra fast high quality image generation
-
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
Paper β’ 2503.09641 β’ Published β’ 40 -
Efficient-Large-Model/Sana_Sprint_1.6B_1024px
Text-to-Image β’ Updated β’ 45 β’ 15 -
Efficient-Large-Model/Sana_Sprint_0.6B_1024px
Text-to-Image β’ Updated β’ 15 β’ 4
β‘οΈSana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
-
Efficient-Large-Model/Sana_1600M_1024px
Text-to-Image β’ Updated β’ 398 β’ β’ 213 -
Efficient-Large-Model/Sana_1600M_1024px_BF16
Text-to-Image β’ Updated β’ 275 β’ 13 -
Efficient-Large-Model/Sana_1600M_1024px_BF16_ControlNet_HED
Text-to-Image β’ Updated β’ 64 -
Efficient-Large-Model/Sana_600M_1024px_ControlNet_HED
Text-to-Image β’ Updated β’ 194
-
Efficient-Large-Model/Llama-3-VILA1.5-8B
Text Generation β’ Updated β’ 1.41k β’ 36 -
Efficient-Large-Model/VILA1.5-40b
Text Generation β’ Updated β’ 3.95k β’ 17 -
Efficient-Large-Model/VILA1.5-3b
Text Generation β’ Updated β’ 43.9k β’ 30 -
Efficient-Large-Model/VILA1.5-3b-AWQ
Text Generation β’ Updated β’ 25 β’ 5
Scaling RL to Long Videos
-
Efficient-Large-Model/NVILA-15B
Text Generation β’ Updated β’ 66.9k β’ 21 -
Efficient-Large-Model/NVILA-Lite-15B
Text Generation β’ Updated β’ 622 β’ 4 -
Efficient-Large-Model/NVILA-Lite-8B
Text Generation β’ Updated β’ 6.86k β’ 2 -
Efficient-Large-Model/NVILA-Lite-8B-stage2
Text Generation β’ Updated β’ 215 β’ 1
SANA-1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
-
SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
Paper β’ 2501.18427 β’ Published β’ 21 -
Efficient-Large-Model/SANA1.5_4.8B_1024px
Text-to-Image β’ Updated β’ 90 β’ β’ 22 -
Efficient-Large-Model/SANA1.5_4.8B_1024px_diffusers
Text-to-Image β’ Updated β’ β’ 13 -
Efficient-Large-Model/SANA1.5_1.6B_1024px
Text-to-Image β’ Updated β’ 584 β’ β’ 1
πSANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
-
412
SanaSprint
πUltra fast high quality image generation
-
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
Paper β’ 2503.09641 β’ Published β’ 40 -
Efficient-Large-Model/Sana_Sprint_1.6B_1024px
Text-to-Image β’ Updated β’ 45 β’ 15 -
Efficient-Large-Model/Sana_Sprint_0.6B_1024px
Text-to-Image β’ Updated β’ 15 β’ 4
β‘οΈSana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
-
Efficient-Large-Model/Sana_1600M_1024px
Text-to-Image β’ Updated β’ 398 β’ β’ 213 -
Efficient-Large-Model/Sana_1600M_1024px_BF16
Text-to-Image β’ Updated β’ 275 β’ 13 -
Efficient-Large-Model/Sana_1600M_1024px_BF16_ControlNet_HED
Text-to-Image β’ Updated β’ 64 -
Efficient-Large-Model/Sana_600M_1024px_ControlNet_HED
Text-to-Image β’ Updated β’ 194
A series of VILA models that specialize for **long-context** abilities
-
Efficient-Large-Model/Llama-3-VILA1.5-8B
Text Generation β’ Updated β’ 1.41k β’ 36 -
Efficient-Large-Model/VILA1.5-40b
Text Generation β’ Updated β’ 3.95k β’ 17 -
Efficient-Large-Model/VILA1.5-3b
Text Generation β’ Updated β’ 43.9k β’ 30 -
Efficient-Large-Model/VILA1.5-3b-AWQ
Text Generation β’ Updated β’ 25 β’ 5