Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling Paper • 2507.07982 • Published 3 days ago • 28
Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data Paper • 2507.07095 • Published 4 days ago • 50
SingLoRA: Low Rank Adaptation Using a Single Matrix Paper • 2507.05566 • Published 6 days ago • 89
LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion Paper • 2507.02813 • Published 10 days ago • 56
view article Article Transformers Are Getting Old: Variants and Alternatives Exist! By ProCreations • 8 days ago • 36
Lost in Latent Space: An Empirical Study of Latent Diffusion Models for Physics Emulation Paper • 2507.02608 • Published 11 days ago • 20
Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation Paper • 2507.01957 • Published 11 days ago • 18
XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation Paper • 2506.21416 • Published 17 days ago • 28
BlenderFusion: 3D-Grounded Visual Editing and Generative Compositing Paper • 2506.17450 • Published 23 days ago • 61
Learning to Skip the Middle Layers of Transformers Paper • 2506.21103 • Published 18 days ago • 16
VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory Paper • 2506.18903 • Published 20 days ago • 19
Optimizing Multilingual Text-To-Speech with Accents & Emotions Paper • 2506.16310 • Published 25 days ago • 24
view article Article (LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware By derekl35 and 4 others • 25 days ago • 75
GenRecal: Generation after Recalibration from Large to Small Vision-Language Models Paper • 2506.15681 • Published 25 days ago • 37
Squeeze3D: Your 3D Generation Model is Secretly an Extreme Neural Compressor Paper • 2506.07932 • Published Jun 9 • 12
DeepVideo-R1: Video Reinforcement Fine-Tuning via Difficulty-aware Regressive GRPO Paper • 2506.07464 • Published Jun 9 • 12
Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models Paper • 2506.07177 • Published Jun 8 • 22
Seeing Voices: Generating A-Roll Video from Audio with Mirage Paper • 2506.08279 • Published Jun 9 • 27