Fan Zhou's picture

Fan Zhou

koalazf99

·

https://koalazf99.github.io/

AI & ML interests

Deep Learning; Natural Language Processing; Foundation Models

Recent Activity

authored a paper about 13 hours ago

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

new activity about 19 hours ago

OctoThinker/MegaMath-Web-Pro-Max:[bot] Conversion to Parquet

liked a dataset 1 day ago

OctoThinker/MegaMath-Web-Pro-Max

View all activity

Organizations

authored a paper about 13 hours ago

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published 2 days ago • 30

authored a paper 7 days ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published 10 days ago • 42

authored a paper 3 months ago

MegaMath: Pushing the Limits of Open Math Corpora

Paper • 2504.02807 • Published Apr 3 • 31

authored a paper 4 months ago

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Paper • 2502.12982 • Published Feb 18 • 18

authored a paper 6 months ago

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published Dec 23, 2024 • 44

authored a paper 9 months ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 63

authored a paper about 1 year ago

OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI

Paper • 2406.12753 • Published Jun 18, 2024 • 14

authored 2 papers over 1 year ago

Dissecting Human and LLM Preferences

Paper • 2402.11296 • Published Feb 17, 2024 • 3

TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Data

Paper • 2205.12682 • Published May 25, 2022