GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning Paper • 2504.00891 • Published 7 days ago • 12
GenPRM Collection A collection of GenPRM. Project page: https://ryanliu112.github.io/GenPRM • 6 items • Updated 2 days ago • 4
CodeI/O Collection Collection for CodeI/O @ https://codei-o.github.io/ • 15 items • Updated Feb 13 • 6
VersaPRM Collection Collection of VersaPRMs using various training configurations • 8 items • Updated Feb 8 • 1
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper • 2502.06703 • Published Feb 10 • 149