2 18 10

zpysky1125

pyzhao

AI & ML interests

None yet

Recent Activity

upvoted an article 8 days ago

Why We Built VIBE Bench: Rethinking Evaluation for Real Workloads

liked a model 22 days ago

MiniMaxAI/MiniMax-M2.5

upvoted an article 22 days ago

Forge: Scalable Agent RL Framework and Algorithm

View all activity

Organizations

upvoted an article 8 days ago

Article

Why We Built VIBE Bench: Rethinking Evaluation for Real Workloads

Jan 6

•

liked a model 22 days ago

MiniMaxAI/MiniMax-M2.5

Text Generation • 229B • Updated about 24 hours ago • 390k • • 1.11k

upvoted an article 22 days ago

Article

Forge: Scalable Agent RL Framework and Algorithm

22 days ago

•

134

liked a dataset about 2 months ago

MiniMaxAI/OctoCodingBench

Viewer • Updated Jan 13 • 72 • 804 • 263

upvoted an article 2 months ago

Article

M2.1: Multilingual and Multi-Task Coding with Strong Generalization

Jan 5

•

liked a model 2 months ago

MiniMaxAI/MiniMax-M2.1

Text Generation • 229B • Updated 22 days ago • 67.7k • • 1.26k

liked a dataset 2 months ago

MiniMaxAI/VIBE

Viewer • Updated Dec 23, 2025 • 200 • 583 • 272

upvoted a collection 3 months ago

VTP

Collection

Towards Scalable Pre-training of Visual Tokenizers for Generation • 4 items • Updated 22 days ago • 42

upvoted a paper 3 months ago

Towards Scalable Pre-training of Visual Tokenizers for Generation

Paper • 2512.13687 • Published Dec 15, 2025 • 106

upvoted 3 articles 4 months ago

Article

What makes good reasoning data

Oct 30, 2025

•

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30, 2025

•

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

Oct 30, 2025

•

New activity in MiniMaxAI/MiniMax-M2 4 months ago

Was the training done with FP8 or BF16?

#14 opened 4 months ago by

mindkrypted

About the LCB evaluation

➕ 2

#13 opened 4 months ago by

sayhitoday

liked a model 4 months ago

MiniMaxAI/MiniMax-M2

Text Generation • 229B • Updated Dec 23, 2025 • 233k • • 1.49k

upvoted a paper 6 months ago

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Paper • 2509.06501 • Published Sep 8, 2025 • 82

upvoted a paper 9 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16, 2025 • 273

liked 2 models 9 months ago

MiniMaxAI/MiniMax-M1-80k

Text Generation • Updated Jul 7, 2025 • 63.3k • • 690

MiniMaxAI/MiniMax-M1-40k

Text Generation • Updated Jul 7, 2025 • 11.6k • 184

upvoted a paper 9 months ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30, 2025 • 144

zpysky1125

AI & ML interests

Recent Activity

Organizations

pyzhao's activity

Why We Built VIBE Bench: Rethinking Evaluation for Real Workloads

Forge: Scalable Agent RL Framework and Algorithm

M2.1: Multilingual and Multi-Task Coding with Strong Generalization

What makes good reasoning data

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Why Did MiniMax M2 End Up as a Full Attention Model?

Was the training done with FP8 or BF16?

About the LCB evaluation