5 14 19

Ganqu Cui

ganqu

cgq15

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning

authored a paper 9 days ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

upvoted a paper 9 days ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

View all activity

Organizations

ganqu's activity

upvoted a paper 1 day ago

Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning

Paper • 2506.04207 • Published 2 days ago • 41

authored a paper 9 days ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published 9 days ago • 116

upvoted a paper 9 days ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published 9 days ago • 116

commented a paper 9 days ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published 9 days ago • 116 •

upvoted a paper 15 days ago

Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models

Paper • 2505.14810 • Published 17 days ago • 60

authored a paper about 1 month ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 112

upvoted a paper about 1 month ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 112

authored a paper about 2 months ago

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21 • 85

upvoted a paper about 2 months ago

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21 • 85

authored a paper 2 months ago

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Paper • 2503.21614 • Published Mar 27 • 39

upvoted a paper 2 months ago

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Paper • 2503.21614 • Published Mar 27 • 39

upvoted 2 papers 3 months ago

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Paper • 2503.11224 • Published Mar 14 • 27

Liger: Linearizing Large Language Models to Gated Recurrent Structures

Paper • 2503.01496 • Published Mar 3 • 18

liked 4 datasets 4 months ago

upvoted a paper 4 months ago

LASP-2: Rethinking Sequence Parallelism for Linear Attention and Its Hybrid

Paper • 2502.07563 • Published Feb 11 • 24

authored a paper 4 months ago

UltraIF: Advancing Instruction Following from the Wild

Paper • 2502.04153 • Published Feb 6 • 24

upvoted a paper 4 months ago

UltraIF: Advancing Instruction Following from the Wild

Paper • 2502.04153 • Published Feb 6 • 24