ZHOU's picture

5 1

ZHOU

TOBI-X

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

upvoted a paper 2 days ago

reWordBench: Benchmarking and Improving the Robustness of Reward Models with Transformed Inputs

upvoted a collection 7 days ago

🧠 Reasoning datasets

View all activity

Organizations

None yet

TOBI-X's activity

upvoted a paper 1 day ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published 2 days ago • 69

upvoted a paper 2 days ago

reWordBench: Benchmarking and Improving the Robustness of Reward Models with Transformed Inputs

Paper • 2503.11751 • Published 6 days ago • 15

upvoted a collection 7 days ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 16 items • Updated about 22 hours ago • 108

liked a dataset 11 days ago

amphora/MCLM

Viewer • Updated 16 days ago • 156 • 553 • 1

upvoted a paper about 1 month ago

BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models

Paper • 2502.07346 • Published Feb 11 • 51

upvoted a collection 4 months ago

MoEs papers reading list

60 items • Updated Nov 4, 2024 • 141