Improve model card: Update metadata, paper link, and clean content (#1)
Browse files- Improve model card: Update metadata, paper link, and clean content (c8c00d5be76065d9a4fffde2edb8bfd16c96812e)
Co-authored-by: Niels Rogge <[email protected]>
README.md
CHANGED
@@ -1,23 +1,25 @@
|
|
1 |
---
|
2 |
-
|
|
|
3 |
datasets:
|
4 |
- brivangl/midjourney-v6-llava
|
5 |
language:
|
6 |
- en
|
7 |
- zh
|
8 |
-
|
9 |
-
|
10 |
-
|
11 |
---
|
|
|
12 |
# Harmon-1.5B-RecA
|
13 |
|
14 |
> A self-supervised training framework that aligns understanding and generation in modest compute, with huge **zero-shot** gain on generation and editing capability.
|
15 |
|
16 |
-
This repository hosts the model weights for **Harmon-1.5B-RecA
|
17 |
|
18 |
## 🧠 Method
|
19 |
|
20 |
-
[](https://
|
21 |
[](https://arxiv.org/abs/2509.07295)
|
22 |
[](https://github.com/HorizonWind2004/reconstruction-alignment)
|
23 |
[](https://huggingface.co/collections/sanaka87/realign-68ad2176380355a3dcedc068)
|
@@ -32,15 +34,11 @@ This repository hosts the model weights for **Harmon-1.5B-RecA**. For installati
|
|
32 |
| **Harmon-1.5B** | 0.73 | 80.93 | 0.41 |
|
33 |
| **Harmon-1.5B-RecA** | **0.86** | **87.21** | **0.50** |
|
34 |
|
35 |
-
## License
|
36 |
-
|
37 |
-
Harmon-1.5B-RecA is licensed under the Apache 2.0 license.
|
38 |
-
|
39 |
-
|
40 |
## ✍️ Citation
|
41 |
|
42 |
If you find our work inspiring or use our codebase in your research, please consider giving a star ⭐ and a citation~
|
43 |
|
|
|
44 |
@misc{xie2025reconstructionalignmentimprovesunified,
|
45 |
title={Reconstruction Alignment Improves Unified Multimodal Models},
|
46 |
author={Ji Xie and Trevor Darrell and Luke Zettlemoyer and XuDong Wang},
|
@@ -49,4 +47,5 @@ If you find our work inspiring or use our codebase in your research, please cons
|
|
49 |
archivePrefix={arXiv},
|
50 |
primaryClass={cs.CV},
|
51 |
url={https://arxiv.org/abs/2509.07295},
|
52 |
-
}
|
|
|
|
1 |
---
|
2 |
+
base_model:
|
3 |
+
- wusize/Harmon-1_5B
|
4 |
datasets:
|
5 |
- brivangl/midjourney-v6-llava
|
6 |
language:
|
7 |
- en
|
8 |
- zh
|
9 |
+
license: apache-2.0
|
10 |
+
pipeline_tag: text-to-image
|
11 |
+
library_name: diffusers
|
12 |
---
|
13 |
+
|
14 |
# Harmon-1.5B-RecA
|
15 |
|
16 |
> A self-supervised training framework that aligns understanding and generation in modest compute, with huge **zero-shot** gain on generation and editing capability.
|
17 |
|
18 |
+
This repository hosts the model weights for **Harmon-1.5B-RecA**, a model from the paper [Reconstruction Alignment Improves Unified Multimodal Models](https://huggingface.co/papers/2509.07295). For installation, usage instructions, and further documentation, please visit Harmon's original [GitHub repository](https://github.com/wusize/Harmon).
|
19 |
|
20 |
## 🧠 Method
|
21 |
|
22 |
+
[](https://huggingface.co/papers/2509.07295)
|
23 |
[](https://arxiv.org/abs/2509.07295)
|
24 |
[](https://github.com/HorizonWind2004/reconstruction-alignment)
|
25 |
[](https://huggingface.co/collections/sanaka87/realign-68ad2176380355a3dcedc068)
|
|
|
34 |
| **Harmon-1.5B** | 0.73 | 80.93 | 0.41 |
|
35 |
| **Harmon-1.5B-RecA** | **0.86** | **87.21** | **0.50** |
|
36 |
|
|
|
|
|
|
|
|
|
|
|
37 |
## ✍️ Citation
|
38 |
|
39 |
If you find our work inspiring or use our codebase in your research, please consider giving a star ⭐ and a citation~
|
40 |
|
41 |
+
```bibtex
|
42 |
@misc{xie2025reconstructionalignmentimprovesunified,
|
43 |
title={Reconstruction Alignment Improves Unified Multimodal Models},
|
44 |
author={Ji Xie and Trevor Darrell and Luke Zettlemoyer and XuDong Wang},
|
|
|
47 |
archivePrefix={arXiv},
|
48 |
primaryClass={cs.CV},
|
49 |
url={https://arxiv.org/abs/2509.07295},
|
50 |
+
}
|
51 |
+
```
|