zhu
xuekai
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
13 days ago
ASPO: Asymmetric Importance Sampling Policy Optimization
upvoted
a
paper
15 days ago
Agentic Context Engineering: Evolving Contexts for Self-Improving
Language Models
upvoted
a
paper
22 days ago
From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by
Composing Old Ones