Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights Paper • 2506.16406 • Published 18 days ago • 118
Focus on Neighbors and Know the Whole: Towards Consistent Dense Multiview Text-to-Image Generator for 3D Creation Paper • 2408.13149 • Published Aug 23, 2024 • 1
1000+ FPS 4D Gaussian Splatting for Dynamic Scene Rendering Paper • 2503.16422 • Published Mar 20 • 14
OminiControl2: Efficient Conditioning for Diffusion Transformers Paper • 2503.08280 • Published Mar 11
Consistent-Teacher: Towards Reducing Inconsistent Pseudo-targets in Semi-supervised Object Detection Paper • 2209.01589 • Published Sep 4, 2022
Image Editing As Programs with Diffusion Models Paper • 2506.04158 • Published Jun 4 • 24 • 2
Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps Paper • 2505.18675 • Published May 24 • 23
Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding Paper • 2505.16990 • Published May 22 • 21
Running on Zero 338 338 OminiControl Art 🎨 Transform images into artistic styles like Studio Ghibli