Model Already Knows the Best Noise: Bayesian Active Noise Selection via Attention in Video Diffusion Model Paper • 2505.17561 • Published 15 days ago • 30
Scaling Image and Video Generation via Test-Time Evolutionary Search Paper • 2505.17618 • Published 15 days ago • 39
Softpick: No Attention Sink, No Massive Activations with Rectified Softmax Paper • 2504.20966 • Published Apr 29 • 31
Running 2.66k 2.66k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation Paper • 2501.09503 • Published Jan 16 • 14
Do generative video models learn physical principles from watching videos? Paper • 2501.09038 • Published Jan 14 • 35
GameFactory: Creating New Games with Generative Interactive Videos Paper • 2501.08325 • Published Jan 14 • 66
EMO2: End-Effector Guided Audio-Driven Avatar Video Generation Paper • 2501.10687 • Published Jan 18 • 14
TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space Paper • 2501.12224 • Published Jan 21 • 48
Multi-subject Open-set Personalization in Video Generation Paper • 2501.06187 • Published Jan 10 • 14
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Paper • 2501.09732 • Published Jan 16 • 72
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature Paper • 2501.07171 • Published Jan 13 • 56