Any-to-Any
English
Chinese
sanaka87 nielsr HF Staff commited on
Commit
4d0f804
·
verified ·
1 Parent(s): 957b72e

Improve model card: Update metadata, paper link, and clean content (#1)

Browse files

- Improve model card: Update metadata, paper link, and clean content (c8c00d5be76065d9a4fffde2edb8bfd16c96812e)


Co-authored-by: Niels Rogge <[email protected]>

Files changed (1) hide show
  1. README.md +11 -12
README.md CHANGED
@@ -1,23 +1,25 @@
1
  ---
2
- license: apache-2.0
 
3
  datasets:
4
  - brivangl/midjourney-v6-llava
5
  language:
6
  - en
7
  - zh
8
- base_model:
9
- - wusize/Harmon-1_5B
10
- pipeline_tag: any-to-any
11
  ---
 
12
  # Harmon-1.5B-RecA
13
 
14
  > A self-supervised training framework that aligns understanding and generation in modest compute, with huge **zero-shot** gain on generation and editing capability.
15
 
16
- This repository hosts the model weights for **Harmon-1.5B-RecA**. For installation, usage instructions, and further documentation, please visit Harmon's original [GitHub repository](https://github.com/wusize/Harmon).
17
 
18
  ## 🧠 Method
19
 
20
- [![Paper](https://img.shields.io/badge/paper-A42C25?style=for-the-badge&logo=arxiv&logoColor=white)](https://arxiv.org/pdf/2509.07295)
21
  [![ArXiv](https://img.shields.io/badge/arXiv-A42C25?style=for-the-badge&logo=arxiv&logoColor=white&color=blue)](https://arxiv.org/abs/2509.07295)
22
  [![Github](https://img.shields.io/badge/RecA-000000?style=for-the-badge&logo=github&logoColor=000&logoColor=white)](https://github.com/HorizonWind2004/reconstruction-alignment)
23
  [![Hugging Face Collection](https://img.shields.io/badge/HF_Models-fcd022?style=for-the-badge&logo=huggingface&logoColor=000)](https://huggingface.co/collections/sanaka87/realign-68ad2176380355a3dcedc068)
@@ -32,15 +34,11 @@ This repository hosts the model weights for **Harmon-1.5B-RecA**. For installati
32
  | **Harmon-1.5B** | 0.73 | 80.93 | 0.41 |
33
  | **Harmon-1.5B-RecA** | **0.86** | **87.21** | **0.50** |
34
 
35
- ## License
36
-
37
- Harmon-1.5B-RecA is licensed under the Apache 2.0 license.
38
-
39
-
40
  ## ✍️ Citation
41
 
42
  If you find our work inspiring or use our codebase in your research, please consider giving a star ⭐ and a citation~
43
 
 
44
  @misc{xie2025reconstructionalignmentimprovesunified,
45
  title={Reconstruction Alignment Improves Unified Multimodal Models},
46
  author={Ji Xie and Trevor Darrell and Luke Zettlemoyer and XuDong Wang},
@@ -49,4 +47,5 @@ If you find our work inspiring or use our codebase in your research, please cons
49
  archivePrefix={arXiv},
50
  primaryClass={cs.CV},
51
  url={https://arxiv.org/abs/2509.07295},
52
- }
 
 
1
  ---
2
+ base_model:
3
+ - wusize/Harmon-1_5B
4
  datasets:
5
  - brivangl/midjourney-v6-llava
6
  language:
7
  - en
8
  - zh
9
+ license: apache-2.0
10
+ pipeline_tag: text-to-image
11
+ library_name: diffusers
12
  ---
13
+
14
  # Harmon-1.5B-RecA
15
 
16
  > A self-supervised training framework that aligns understanding and generation in modest compute, with huge **zero-shot** gain on generation and editing capability.
17
 
18
+ This repository hosts the model weights for **Harmon-1.5B-RecA**, a model from the paper [Reconstruction Alignment Improves Unified Multimodal Models](https://huggingface.co/papers/2509.07295). For installation, usage instructions, and further documentation, please visit Harmon's original [GitHub repository](https://github.com/wusize/Harmon).
19
 
20
  ## 🧠 Method
21
 
22
+ [![Paper](https://img.shields.io/badge/paper-A42C25?style=for-the-badge&logo=arxiv&logoColor=white)](https://huggingface.co/papers/2509.07295)
23
  [![ArXiv](https://img.shields.io/badge/arXiv-A42C25?style=for-the-badge&logo=arxiv&logoColor=white&color=blue)](https://arxiv.org/abs/2509.07295)
24
  [![Github](https://img.shields.io/badge/RecA-000000?style=for-the-badge&logo=github&logoColor=000&logoColor=white)](https://github.com/HorizonWind2004/reconstruction-alignment)
25
  [![Hugging Face Collection](https://img.shields.io/badge/HF_Models-fcd022?style=for-the-badge&logo=huggingface&logoColor=000)](https://huggingface.co/collections/sanaka87/realign-68ad2176380355a3dcedc068)
 
34
  | **Harmon-1.5B** | 0.73 | 80.93 | 0.41 |
35
  | **Harmon-1.5B-RecA** | **0.86** | **87.21** | **0.50** |
36
 
 
 
 
 
 
37
  ## ✍️ Citation
38
 
39
  If you find our work inspiring or use our codebase in your research, please consider giving a star ⭐ and a citation~
40
 
41
+ ```bibtex
42
  @misc{xie2025reconstructionalignmentimprovesunified,
43
  title={Reconstruction Alignment Improves Unified Multimodal Models},
44
  author={Ji Xie and Trevor Darrell and Luke Zettlemoyer and XuDong Wang},
 
47
  archivePrefix={arXiv},
48
  primaryClass={cs.CV},
49
  url={https://arxiv.org/abs/2509.07295},
50
+ }
51
+ ```