Critique to Verify: Accurate and Honest Test-Time Scaling with RL-Trained Verifiers (https://arxiv.org/abs/2509.23152)
Zhicheng YANG
yangzhch6
AI & ML interests
reasoning with LLMs
Recent Activity
updated
a model
1 day ago
yangzhch6/Mirror-Verifier-1.5B
updated
a model
1 day ago
yangzhch6/Mirror-Verifier-7B
updated
a model
1 day ago
yangzhch6/Zero-Solver-Qwen2.5-Math-7B-L
Organizations
None yet