-
Unified Personalized Reward Model for Vision Generation
Paper • 2602.02380 • Published • 19 -
CodeGoat24/FLUX.2-klein-base-9B-UnifiedReward-Flex-lora
Text-to-Image • Updated • 58 • 5 -
CodeGoat24/Wan2.1-T2V-14B-UnifiedReward-Flex-lora
Text-to-Video • Updated • 65 • 5 -
CodeGoat24/FLUX.1-dev-UnifiedReward-Flex
Text-to-Image • Updated • 27
SII-Yibin Wang
CodeGoat24
AI & ML interests
I'm part of Shanghai Innovation Institute, focusing on Multimodal RL and Generation.
Recent Activity
updated
a model
about 21 hours ago
CodeGoat24/UnifiedReward-Flex-qwen3vl-2b
updated
a model
about 21 hours ago
CodeGoat24/UnifiedReward-Flex-qwen3vl-32b
updated
a model
about 21 hours ago
CodeGoat24/UnifiedReward-Flex-qwen3vl-4b
Organizations
Pref-GRPO & UniGenBench
-
UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation
Paper • 2510.18701 • Published • 67 -
Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning
Paper • 2508.20751 • Published • 89 -
CodeGoat24/UniGenBench-Eval-Images
Preview • Updated • 1.07k • 4 -
CodeGoat24/UniGenBench-EvalModel-qwen3vl-32b-v1
Image-to-Text • 1.14M • Updated • 16
UnifiedReward Flex
-
Unified Personalized Reward Model for Vision Generation
Paper • 2602.02380 • Published • 19 -
CodeGoat24/FLUX.2-klein-base-9B-UnifiedReward-Flex-lora
Text-to-Image • Updated • 58 • 5 -
CodeGoat24/Wan2.1-T2V-14B-UnifiedReward-Flex-lora
Text-to-Video • Updated • 65 • 5 -
CodeGoat24/FLUX.1-dev-UnifiedReward-Flex
Text-to-Image • Updated • 27
Pref-GRPO & UniGenBench
-
UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation
Paper • 2510.18701 • Published • 67 -
Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning
Paper • 2508.20751 • Published • 89 -
CodeGoat24/UniGenBench-Eval-Images
Preview • Updated • 1.07k • 4 -
CodeGoat24/UniGenBench-EvalModel-qwen3vl-32b-v1
Image-to-Text • 1.14M • Updated • 16
spaces
4
pinned
Running
3
UniGenBench Leaderboard (Chinese Long)
🏅
UniGenBench: a unified T2I generation benchmark.
pinned
Running
3
UniGenBench Leaderboard (Chinese)
🏅
UniGenBench: a unified T2I generation benchmark.
pinned
Running
7
UniGenBench Leaderboard (English)
🏅
UniGenBench: a unified T2I generation benchmark.
pinned
Running
3
UniGenBench Leaderboard (English Long)
🏅
UniGenBench: a unified T2I generation benchmark.
models
43
CodeGoat24/UnifiedReward-Flex-qwen3vl-32b
1.14M
•
Updated
•
15
CodeGoat24/UnifiedReward-Flex-qwen3vl-2b
2B
•
Updated
•
38
CodeGoat24/UnifiedReward-Flex-qwen3vl-4b
4B
•
Updated
•
35
CodeGoat24/UnifiedReward-Flex-qwen3vl-8b
9B
•
Updated
•
171
CodeGoat24/FLUX.1-dev-UnifiedReward-Flex
Text-to-Image
•
Updated
•
27
CodeGoat24/Wan2.1-T2V-14B-UnifiedReward-Flex-lora
Text-to-Video
•
Updated
•
65
•
5
CodeGoat24/FLUX.2-klein-base-9B-UnifiedReward-Flex-lora
Text-to-Image
•
Updated
•
58
•
5
CodeGoat24/UnifiedReward-Think-qwen3vl-32b
1.14M
•
Updated
•
385
CodeGoat24/UniGenBench-EvalModel-qwen3vl-32b-v1
Image-to-Text
•
1.14M
•
Updated
•
16
CodeGoat24/UnifiedReward-Think-qwen3vl-4b
4B
•
Updated
•
26
datasets
14
CodeGoat24/UnifiedReward-Flex-SFT-90K
Viewer
•
Updated
•
1.39M
•
75
•
2
CodeGoat24/UniGenBench-Eval-Images
Preview
•
Updated
•
1.07k
•
4
CodeGoat24/UniGenBench
Updated
•
13
•
3
CodeGoat24/UnifiedReward-2.0-T2X-score-data
Viewer
•
Updated
•
337k
•
353
CodeGoat24/VIDEOGEN
Viewer
•
Updated
•
50.9k
•
9
CodeGoat24/ShareGPTVideo-DPO
Viewer
•
Updated
•
101k
•
47
CodeGoat24/VideoFeedback
Viewer
•
Updated
•
73.2k
•
54
CodeGoat24/VideoDPO
Viewer
•
Updated
•
29k
•
206
CodeGoat24/OIP
Viewer
•
Updated
•
21.4k
•
61
CodeGoat24/LLaVA-Critic-113k
Preview
•
Updated
•
168