1 26 12

Ruihan Yang

rhyang2021

https://github.com/rhyang2021

rhyang2021

AI & ML interests

NLP, Agent Learning, Uncertainty

Recent Activity

liked a model 3 days ago

MASWorks/MAS-GPT-32B

upvoted a paper 9 days ago

The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks

upvoted a paper 10 days ago

Group-in-Group Policy Optimization for LLM Agent Training

View all activity

Organizations

None yet

upvoted a paper 9 days ago

The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks

Paper • 2502.08235 • Published Feb 12 • 59

upvoted a paper 10 days ago

Group-in-Group Policy Optimization for LLM Agent Training

Paper • 2505.10978 • Published May 16 • 10

upvoted a paper 21 days ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 215

upvoted a paper 25 days ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published 27 days ago • 252

upvoted a collection 27 days ago

MiniMax-M1

Collection

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. • 6 items • Updated 10 days ago • 106

upvoted 13 papers about 1 month ago

Think on your Feet: Adaptive Thinking via Reinforcement Learning for Social Agents

Paper • 2505.02156 • Published May 4 • 18

TimeHC-RL: Temporal-aware Hierarchical Cognitive Reinforcement Learning for Enhancing LLMs' Social Intelligence

Paper • 2505.24500 • Published May 30 • 12

SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals

Paper • 2406.04784 • Published Jun 7, 2024 • 2

Robot-R1: Reinforcement Learning for Enhanced Embodied Reasoning in Robotics

Paper • 2506.00070 • Published May 29 • 28

Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning

Paper • 2506.03136 • Published Jun 3 • 23

GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

Paper • 2506.03143 • Published Jun 3 • 48

ARM: Adaptive Reasoning Model

Paper • 2505.20258 • Published May 26 • 44

ARIA: Training Language Agents with Intention-Driven Reward Aggregation

Paper • 2506.00539 • Published May 31 • 30

ATLAS: Learning to Optimally Memorize the Context at Test Time

Paper • 2505.23735 • Published May 29 • 23

Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles

Paper • 2505.19914 • Published May 26 • 43

upvoted 2 papers about 2 months ago

From Persona to Personalization: A Survey on Role-Playing Language Agents

Paper • 2404.18231 • Published Apr 28, 2024 • 1

The Lighthouse of Language: Enhancing LLM Agents via Critique-Guided Improvement

Paper • 2503.16024 • Published Mar 20 • 1

Ruihan Yang

AI & ML interests

Recent Activity

Organizations

rhyang2021's activity