21 37 2

Tianyi Zhou

zhoutianyi

https://tianyizhou.github.io/

AI & ML interests

ML, NLP, RL, Multi-modality

Recent Activity

commented on a paper 1 day ago

Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

upvoted a paper 1 day ago

FaSTA^*: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing

authored a paper 1 day ago

Don't Think Longer, Think Wisely: Optimizing Thinking Dynamics for Large Reasoning Models

View all activity

Organizations

commented a paper 1 day ago

Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

Paper • 2506.21551 • Published 2 days ago • 25 •

upvoted a paper 1 day ago

FaSTA^*: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing

Paper • 2506.20911 • Published 3 days ago • 37

authored 4 papers 1 day ago

Don't Think Longer, Think Wisely: Optimizing Thinking Dynamics for Large Reasoning Models

Paper • 2505.21765 • Published May 27

Pisces: An Auto-regressive Foundation Model for Image Understanding and Generation

Paper • 2506.10395 • Published 17 days ago

Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

Paper • 2506.21551 • Published 2 days ago • 25

FaSTA$^*$: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing

Paper • 2506.20911 • Published 3 days ago • 37

commented a paper 2 days ago

FaSTA$^*$: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing

Paper • 2506.20911 • Published 3 days ago • 37 •

upvoted a paper 2 days ago

Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

Paper • 2506.21551 • Published 2 days ago • 25

commented 2 papers 2 days ago

FaSTA$^*$: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing

Paper • 2506.20911 • Published 3 days ago • 37 •

Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

Paper • 2506.21551 • Published 2 days ago • 25 •

upvoted a paper 10 days ago

Optimizing Length Compression in Large Reasoning Models

Paper • 2506.14755 • Published 11 days ago • 10

authored a paper 11 days ago

Optimizing Length Compression in Large Reasoning Models

Paper • 2506.14755 • Published 11 days ago • 10

commented a paper 11 days ago

Optimizing Length Compression in Large Reasoning Models

Paper • 2506.14755 • Published 11 days ago • 10 •

authored a paper 11 days ago

Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency

Paper • 2506.08343 • Published 19 days ago • 46

upvoted a paper 11 days ago

Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency

Paper • 2506.08343 • Published 19 days ago • 46

authored a paper 18 days ago

What makes Reasoning Models Different? Follow the Reasoning Leader for Efficient Decoding

Paper • 2506.06998 • Published 21 days ago

upvoted a paper about 1 month ago

The Hallucination Tax of Reinforcement Finetuning

Paper • 2505.13988 • Published May 20 • 8

authored 3 papers about 1 month ago

Skill Discovery for Software Scripting Automation via Offline Simulations with LLMs

Paper • 2504.20406 • Published Apr 29 • 7

VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations for Synthetic Videos

Paper • 2505.01481 • Published May 2 • 3

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14 • 94

Tianyi Zhou

AI & ML interests

Recent Activity

Organizations

zhoutianyi's activity