Agent-RL - a jzwong Collection

jzwong 's Collections

MLLM

LLM

LLM-RL

Novel

SYS

Survey

Agent-RL

updated 6 days ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published Apr 15 • 60
ToolRL: Reward is All Tool Learning Needs

Paper • 2504.13958 • Published Apr 16 • 44
OTC: Optimal Tool Calls via Reinforcement Learning

Paper • 2504.14870 • Published Apr 21 • 33
WebThinker: Empowering Large Reasoning Models with Deep Research Capability

Paper • 2504.21776 • Published 23 days ago • 52
ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Paper • 2505.04588 • Published 15 days ago • 63
OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning

Paper • 2505.08617 • Published 10 days ago • 38