RewardBench 2: Advancing Reward Model Evaluation
Paper
•
2506.01937
•
Published
•
4
None defined yet.
siglip2
backbone, (competitions AIOrNot, Imagenette, and Driver-Drowsiness). Models and datasets are listed below: