21 37 2

Tianyi Zhou

zhoutianyi

https://tianyizhou.github.io/

AI & ML interests

ML, NLP, RL, Multi-modality

Recent Activity

commented on a paper 1 day ago

Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

upvoted a paper 1 day ago

FaSTA^*: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing

authored a paper 1 day ago

Don't Think Longer, Think Wisely: Optimizing Thinking Dynamics for Large Reasoning Models

View all activity

Organizations

commented a paper 1 day ago

Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

Paper • 2506.21551 • Published 2 days ago • 25 •

commented 3 papers 2 days ago

FaSTA$^*$: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing

Paper • 2506.20911 • Published 3 days ago • 37 •

FaSTA$^*$: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing

Paper • 2506.20911 • Published 3 days ago • 37 •

Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

Paper • 2506.21551 • Published 2 days ago • 25 •

commented a paper 11 days ago

Optimizing Length Compression in Large Reasoning Models

Paper • 2506.14755 • Published 11 days ago • 10 •

commented 8 papers 2 months ago

WALL-E 2.0: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents

Paper • 2504.15785 • Published Apr 22 • 19 •

WALL-E 2.0: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents

Paper • 2504.15785 • Published Apr 22 • 19 •

Exploring Expert Failures Improves LLM Agent Tuning

Paper • 2504.13145 • Published Apr 17 • 12 •

Exploring Expert Failures Improves LLM Agent Tuning

Paper • 2504.13145 • Published Apr 17 • 12 •

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

Paper • 2504.10514 • Published Apr 10 • 47 •

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

Paper • 2504.10514 • Published Apr 10 • 47 •

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

Paper • 2504.10514 • Published Apr 10 • 47 •

How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients

Paper • 2504.10766 • Published Apr 14 • 40 •

commented 4 papers 3 months ago

C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing

Paper • 2504.07964 • Published Apr 10 • 61 •

C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing

Paper • 2504.07964 • Published Apr 10 • 61 •

Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?

Paper • 2504.06514 • Published Apr 9 • 39 •

Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?

Paper • 2504.06514 • Published Apr 9 • 39 •

commented 3 papers 4 months ago

Tianyi Zhou

AI & ML interests

Recent Activity

Organizations

zhoutianyi's activity