Cheng Qian's picture

2 23

Cheng Qian

chengq9

·

https://qiancheng0.github.io

qiancheng0

AI & ML interests

Agent, Tool Learning

Recent Activity

upvoted a paper 8 days ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

upvoted a paper 2 months ago

Multimodal Policy Internalization for Conversational Agents

upvoted a paper 2 months ago

Self-Improving LLM Agents at Test-Time

View all activity

Organizations

Collections 1

Papers 17

arxiv:2509.19736

arxiv:2509.09614

arxiv:2507.22034

arxiv:2507.21046

models 3

chengq9/ToolRL-Qwen2.5-1.5B

2B • Updated Apr 22 • 111

chengq9/ToolRL-Qwen2.5-3B

3B • Updated Apr 22 • 7.29k • 1

chengq9/ToolRL-Llama3.2-3B

4B • Updated Apr 22 • 26

datasets 0

None public yet