Qihan Ren's picture

2 13 2

Qihan Ren

jasonrqh

·

https://nebularaid2000.github.io/

AI & ML interests

explainable AI, LLM

Recent Activity

upvoted a paper 2 days ago

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization

upvoted a paper 5 days ago

AutoPR: Let's Automate Your Academic Promotion!

liked a Space 5 days ago

yzweak/AutoPR

View all activity

Organizations

authored 2 papers 12 days ago

Conditional Advantage Estimation for Reinforcement Learning in Large Reasoning Models

Paper • 2509.23962 • Published 20 days ago • 5

Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents

Paper • 2509.26354 • Published 18 days ago • 17

authored 3 papers 3 months ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published Jul 28 • 81

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law

Paper • 2507.18576 • Published Jul 24 • 6

Benchmarking Multimodal Knowledge Conflict for Large Multimodal Models

Paper • 2505.19509 • Published May 26 • 2

authored a paper 5 months ago

Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution

Paper • 2505.20286 • Published May 26 • 7