Papers
arxiv:2501.16764

DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation

Published on Jan 28
· Submitted by paulpanwang on Jan 29
Authors:
,

Abstract

Recent advancements in 3D content generation from text or a single image struggle with limited high-quality 3D datasets and inconsistency from 2D multi-view generation. We introduce DiffSplat, a novel 3D generative framework that natively generates 3D Gaussian splats by taming large-scale text-to-image diffusion models. It differs from previous 3D generative models by effectively utilizing web-scale 2D priors while maintaining 3D consistency in a unified model. To bootstrap the training, a lightweight reconstruction model is proposed to instantly produce multi-view Gaussian splat grids for scalable dataset curation. In conjunction with the regular diffusion loss on these grids, a 3D rendering loss is introduced to facilitate 3D coherence across arbitrary views. The compatibility with image diffusion models enables seamless adaptions of numerous techniques for image generation to the 3D realm. Extensive experiments reveal the superiority of DiffSplat in text- and image-conditioned generation tasks and downstream applications. Thorough ablation studies validate the efficacy of each critical design choice and provide insights into the underlying mechanism.

Community

Paper author Paper submitter

DiffSplat is a generative framework to synthesize 3D Gaussian Splats from text prompts & single-view images in ⚡️ 1~2 seconds. It is fine-tuned directly from a pre-trained text-to-image diffusion model.

Paper author Paper submitter
edited 1 day ago

Project: https://chenguolin.github.io/projects/DiffSplat/
Code: https://github.com/chenguolin/DiffSplat
The code and pre-trained checkpoints are officially released! @akhaliq 🎉🎉🎉
Dive in and explore—feedback is always welcome! 🙌

Sign up or log in to comment

Models citing this paper 1

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2501.16764 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2501.16764 in a Space README.md to link it from this page.

Collections including this paper 2