Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior Paper • 2303.14184 • Published Mar 24, 2023
StyleSwin: Transformer-based GAN for High-resolution Image Generation Paper • 2112.10762 • Published Dec 20, 2021
MLVU: A Comprehensive Benchmark for Multi-Task Long Video Understanding Paper • 2406.04264 • Published Jun 6, 2024 • 2
MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence Paper • 2407.16655 • Published Jul 23, 2024 • 31
DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map Generation Paper • 2405.15619 • Published May 24, 2024
DeepSeek-VL: Towards Real-World Vision-Language Understanding Paper • 2403.05525 • Published Mar 8, 2024 • 47
DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior Paper • 2310.16818 • Published Oct 25, 2023 • 32
Vector Quantized Diffusion Model for Text-to-Image Synthesis Paper • 2111.14822 • Published Nov 29, 2021 • 1
Paint by Example: Exemplar-based Image Editing with Diffusion Models Paper • 2211.13227 • Published Nov 23, 2022 • 3
Pretraining is All You Need for Image-to-Image Translation Paper • 2205.12952 • Published May 25, 2022