Zou Lexiao
Lokshaw
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
16 days ago
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn
Reinforcement Learning
upvoted
a
paper
about 1 month ago
On the Generalization of SFT: A Reinforcement Learning Perspective with
Reward Rectification
upvoted
a
paper
about 1 month ago
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens