HiWave: Training-Free High-Resolution Image Generation via Wavelet-Based Diffusion Sampling Paper • 2506.20452 • Published 3 days ago • 11
Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models Paper • 2506.19697 • Published 3 days ago • 39
AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models Paper • 2506.19851 • Published 3 days ago • 50
Light of Normals: Unified Feature Representation for Universal Photometric Stereo Paper • 2506.18882 • Published 4 days ago • 80
Align Your Flow: Scaling Continuous-Time Flow Map Distillation Paper • 2506.14603 • Published 10 days ago • 18
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention Paper • 2506.13585 • Published 11 days ago • 240
Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation Paper • 2506.11924 • Published 14 days ago • 32
Seeing Voices: Generating A-Roll Video from Audio with Mirage Paper • 2506.08279 • Published 18 days ago • 26
ComfyUI-R1: Exploring Reasoning Models for Workflow Generation Paper • 2506.09790 • Published 16 days ago • 51
SpatialLM: Training Large Language Models for Structured Indoor Modeling Paper • 2506.07491 • Published 19 days ago • 38
Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion Paper • 2506.08009 • Published 18 days ago • 22
Aligning Text, Images, and 3D Structure Token-by-Token Paper • 2506.08002 • Published 18 days ago • 18
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development Paper • 2506.05010 • Published 23 days ago • 68
Normalized Attention Guidance: Universal Negative Guidance for Diffusion Model Paper • 2505.21179 • Published May 27 • 11
AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views Paper • 2505.23716 • Published 29 days ago • 31