Show-o-RecA / README.md

Improve model card: Add Diffusers library, text-to-image pipeline tag, and HF paper link (#1)

fd21514 verified 11 days ago

2.53 kB

	---
	base_model:
	- showlab/show-o-w-clip-vit
	datasets:
	- brivangl/midjourney-v6-llava
	language:
	- en
	- zh
	license: apache-2.0
	pipeline_tag: text-to-image
	library_name: diffusers
	---

	# Show-o-RecA

	> A self-supervised training framework that aligns understanding and generation in modest compute, with huge zero-shot gain on generation and editing capability.

	This repository hosts the model weights for Show-o-RecA. For installation, usage instructions, and further documentation, please visit Show-o's original [GitHub repository](https://github.com/showlab/Show-o).

	## 🧠 Method

	[![Paper](https://img.shields.io/badge/paper-A42C25?style=for-the-badge&logo=arxiv&logoColor=white)](https://arxiv.org/pdf/2509.07295)
	[![ArXiv](https://img.shields.io/badge/arXiv-A42C25?style=for-the-badge&logo=arxiv&logoColor=white&color=blue)](https://arxiv.org/abs/2509.07295)
	[![Hugging Face Paper](https://img.shields.io/badge/%F0%9F%A4%97_Hugging_Face-Paper-yellow)](https://huggingface.co/papers/2509.07295)
	[![Github](https://img.shields.io/badge/RecA-000000?style=for-the-badge&logo=github&logoColor=000&logoColor=white)](https://github.com/HorizonWind2004/reconstruction-alignment)
	[![Hugging Face Collection](https://img.shields.io/badge/HF_Models-fcd022?style=for-the-badge&logo=huggingface&logoColor=000)](https://huggingface.co/collections/sanaka87/realign-68ad2176380355a3dcedc068)
	[![HF Demo](https://img.shields.io/badge/Demo_(BAGEL)-fcd022?style=for-the-badge&logo=huggingface&logoColor=000)](https://huggingface.co/spaces/sanaka87/BAGEL-ReAlign)
	[![Project Page](https://img.shields.io/badge/Project_Page-00CED1?style=for-the-badge&logo=web&logoColor=white)](https://reconstruction-alignment.github.io/)


	## 📊 Benchmarks

	\| Model \| GenEval ↑ \| DPGBench ↑ \| WISE ↑ \|
	\| ------------ \| --------- \| --------- \| --------- \|
	\| Show-o \| 0.57 \| 70.65 \| 0.33 \|
	\| Show-o-RecA \| 0.62 \| 75.70 \| 0.34 \|

	## License

	Show-o-RecA is licensed under the Apache 2.0 license.

	## ✍️ Citation

	If you find our work inspiring or use our codebase in your research, please consider giving a star ⭐ and a citation~

	@misc{xie2025reconstructionalignmentimprovesunified,
	title={Reconstruction Alignment Improves Unified Multimodal Models},
	author={Ji Xie and Trevor Darrell and Luke Zettlemoyer and XuDong Wang},
	year={2025},
	eprint={2509.07295},
	archivePrefix={arXiv},
	primaryClass={cs.CV},
	url={https://arxiv.org/abs/2509.07295},
	}