Text-to-Image
Diffusers
PyTorch
English
Chinese
Show-o-RecA / README.md
sanaka87's picture
Improve model card: Add Diffusers library, text-to-image pipeline tag, and HF paper link (#1)
fd21514 verified
---
base_model:
- showlab/show-o-w-clip-vit
datasets:
- brivangl/midjourney-v6-llava
language:
- en
- zh
license: apache-2.0
pipeline_tag: text-to-image
library_name: diffusers
---
# Show-o-RecA
> A self-supervised training framework that aligns understanding and generation in modest compute, with huge **zero-shot** gain on generation and editing capability.
This repository hosts the model weights for **Show-o-RecA**. For installation, usage instructions, and further documentation, please visit Show-o's original [GitHub repository](https://github.com/showlab/Show-o).
## 🧠 Method
[![Paper](https://img.shields.io/badge/paper-A42C25?style=for-the-badge&logo=arxiv&logoColor=white)](https://arxiv.org/pdf/2509.07295)
[![ArXiv](https://img.shields.io/badge/arXiv-A42C25?style=for-the-badge&logo=arxiv&logoColor=white&color=blue)](https://arxiv.org/abs/2509.07295)
[![Hugging Face Paper](https://img.shields.io/badge/%F0%9F%A4%97_Hugging_Face-Paper-yellow)](https://huggingface.co/papers/2509.07295)
[![Github](https://img.shields.io/badge/RecA-000000?style=for-the-badge&logo=github&logoColor=000&logoColor=white)](https://github.com/HorizonWind2004/reconstruction-alignment)
[![Hugging Face Collection](https://img.shields.io/badge/HF_Models-fcd022?style=for-the-badge&logo=huggingface&logoColor=000)](https://huggingface.co/collections/sanaka87/realign-68ad2176380355a3dcedc068)
[![HF Demo](https://img.shields.io/badge/Demo_(BAGEL)-fcd022?style=for-the-badge&logo=huggingface&logoColor=000)](https://huggingface.co/spaces/sanaka87/BAGEL-ReAlign)
[![Project Page](https://img.shields.io/badge/Project_Page-00CED1?style=for-the-badge&logo=web&logoColor=white)](https://reconstruction-alignment.github.io/)
## πŸ“Š Benchmarks
| Model | GenEval ↑ | DPGBench ↑ | WISE ↑ |
| ------------ | --------- | --------- | --------- |
| **Show-o** | 0.57 | 70.65 | 0.33 |
| **Show-o-RecA** | **0.62** | **75.70** | **0.34** |
## License
Show-o-RecA is licensed under the Apache 2.0 license.
## ✍️ Citation
If you find our work inspiring or use our codebase in your research, please consider giving a star ⭐ and a citation~
@misc{xie2025reconstructionalignmentimprovesunified,
title={Reconstruction Alignment Improves Unified Multimodal Models},
author={Ji Xie and Trevor Darrell and Luke Zettlemoyer and XuDong Wang},
year={2025},
eprint={2509.07295},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2509.07295},
}