-
Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning
Paper • 2508.20751 • Published • 89 -
CodeGoat24/UniGenBench-EvalModel-qwen-72b-v1
Image-to-Text • 73B • Updated • 6 -
CodeGoat24/UniGenBench-Eval-Images
Viewer • Updated • 762k • 3.06k • 2 -
CodeGoat24/UniGenBench
Updated • 118 • 1
SII-Yibin Wang
CodeGoat24
AI & ML interests
I'm part of Shanghai Innovation Institute, focusing on Multimodal RL and Generation.
Recent Activity
updated
a model
3 days ago
CodeGoat24/UniGenBench-EvalModel-qwen-72b-v1
updated
a dataset
3 days ago
CodeGoat24/UniGenBench-Eval-Images
updated
a collection
3 days ago
Pref-GRPO & UniGenBench
Organizations
Pref-GRPO & UniGenBench
-
Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning
Paper • 2508.20751 • Published • 89 -
CodeGoat24/UniGenBench-EvalModel-qwen-72b-v1
Image-to-Text • 73B • Updated • 6 -
CodeGoat24/UniGenBench-Eval-Images
Viewer • Updated • 762k • 3.06k • 2 -
CodeGoat24/UniGenBench
Updated • 118 • 1
UnifiedReward 2.0 Models
spaces
4
pinned
Running
2
UniGenBench Leaderboard (Chinese Long)
🏅
UniGenBench: a unified T2I generation benchmark.
pinned
Running
1
UniGenBench Leaderboard (Chinese)
🏅
UniGenBench: a unified T2I generation benchmark.
pinned
Running
2
UniGenBench Leaderboard (English Long)
🏅
UniGenBench: a unified T2I generation benchmark.
pinned
Running
3
UniGenBench Leaderboard (English)
🏅
UniGenBench: a unified T2I generation benchmark.
models
19

CodeGoat24/UniGenBench-EvalModel-qwen-72b-v1
Image-to-Text
•
73B
•
Updated
•
6

CodeGoat24/UnifiedReward-2.0-qwen-72b
Image-to-Text
•
73B
•
Updated
•
141

CodeGoat24/UnifiedReward-2.0-qwen-32b
33B
•
Updated
•
137

CodeGoat24/UnifiedReward-2.0-qwen-3b
4B
•
Updated
•
78
•
1

CodeGoat24/UnifiedReward-2.0-qwen-7b
8B
•
Updated
•
758

CodeGoat24/FLUX.1-dev-PrefGRPO
Text-to-Image
•
Updated
•
14
•
3

CodeGoat24/UnifiedReward-Think-7b
8B
•
Updated
•
9
•
10

CodeGoat24/UnifiedReward-Think-qwen-7b
8B
•
Updated
•
1.44k
•
3

CodeGoat24/T2V-Turbo
Updated

CodeGoat24/LLaVA-Video-7B-Qwen2-UnifiedReward-DPO
8B
•
Updated
•
4
datasets
15
CodeGoat24/UniGenBench-Eval-Images
Viewer
•
Updated
•
762k
•
3.06k
•
2
CodeGoat24/UniGenBench
Updated
•
118
•
1
CodeGoat24/UnifiedReward-2.0-T2X-score-data
Viewer
•
Updated
•
337k
•
193
CodeGoat24/VIDEOGEN
Viewer
•
Updated
•
50.9k
•
46
CodeGoat24/GENAI-BENCH
Viewer
•
Updated
•
27.8k
•
30
CodeGoat24/ShareGPTVideo-DPO
Viewer
•
Updated
•
101k
•
87
CodeGoat24/VideoFeedback
Viewer
•
Updated
•
73.2k
•
96
CodeGoat24/VideoDPO
Viewer
•
Updated
•
29k
•
97
CodeGoat24/OIP
Viewer
•
Updated
•
21.4k
•
92
CodeGoat24/LLaVA-Critic-113k
Preview
•
Updated
•
62