-
Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning
Paper • 2505.16410 • Published • 56 -
dongguanting/Tool-Star-Qwen-3B
Text Generation • 3B • Updated • 2.06k • 5 -
mradermacher/Tool-Star-Qwen-3B-GGUF
3B • Updated • 179 • 3 -
dongguanting/Tool-Star-SFT-54K
Viewer • Updated • 54k • 775 • 7
KABI
dongguanting
AI & ML interests
Reasoning and Alignment for Large Language Models
Recent Activity
upvoted
a
collection
about 11 hours ago
Qwen3
upvoted
a
paper
11 days ago
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
upvoted
a
paper
11 days ago
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning
Attention