CodeGoat24
/

UnifiedReward-Think-7b

Model card Files Files and versions Community

CodeGoat24 commited on 7 days ago

Commit

373d46a

·

verified ·

1 Parent(s): d668d82

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ base_model:
 `Unified-Reward-Think-7b` is the first unified multimodal CoT reward model, capable of multi-dimensional, step-by-step long-chain reasoning for both visual understanding and generation reward tasks.
 For further details, please refer to the following resources:
-<!-- - 📰 Paper: https://arxiv.org/pdf/2503.05236 -->
 - 🪐 Project Page: https://codegoat24.github.io/UnifiedReward/think
 - 🤗 Model Collections: https://huggingface.co/collections/CodeGoat24/unifiedreward-models-67c3008148c3a380d15ac63a
 - 🤗 Dataset Collections: https://huggingface.co/collections/CodeGoat24/unifiedreward-training-data-67c300d4fd5eff00fa7f1ede
@@ -112,10 +112,10 @@ print(text_outputs[0])
 ## Citation
 ```
-@article{UnifiedReward,
   title={Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning.},
-  author={Wang, Yibin and Li, Zhimin and Zang, Yuhang and Wang, Chunyu and Lu, Qinglin and Jin, Cheng and Wang, Jiaqi},
-  journal={arXiv preprint arXiv:},
   year={2025}
 }
 ```

 `Unified-Reward-Think-7b` is the first unified multimodal CoT reward model, capable of multi-dimensional, step-by-step long-chain reasoning for both visual understanding and generation reward tasks.
 For further details, please refer to the following resources:
+- 📰 Paper: https://arxiv.org/pdf/2505.03318
 - 🪐 Project Page: https://codegoat24.github.io/UnifiedReward/think
 - 🤗 Model Collections: https://huggingface.co/collections/CodeGoat24/unifiedreward-models-67c3008148c3a380d15ac63a
 - 🤗 Dataset Collections: https://huggingface.co/collections/CodeGoat24/unifiedreward-training-data-67c300d4fd5eff00fa7f1ede
 ## Citation
 ```
+@article{UnifiedReward-Think,
   title={Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning.},
+  author={Wang, Yibin and Li, Zhimin and Zang, Yuhang and Wang, Chunyu and Lu, Qinglin, and Jin, Cheng and Wang, Jiaqi},
+  journal={arXiv preprint arXiv:2505.03318},
   year={2025}
 }
 ```