Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

  • This model belongs to the family of official Lotus models.
  • Compared to the previous version, this model is trained in disparity space (inverse depth), achieving better performance and more stable video depth estimation.

Paper Paper HuggingFace Demo GitHub

Developed by: Jing He✱, Haodong Li✱, Wei Yin, Yixun Liang, Leheng Li, Kaiqiang Zhou, Hongbo Zhang, Bingbing Liu, Ying-Cong Chen✉

teaser teaser

Usage

Please refer to this page.

Downloads last month
1,071
Inference API
Inference API (serverless) does not yet support diffusers models for this pipeline type.