Hammer: Robust Function-Calling for On-Device Language Models via Function Masking Paper • 2410.04587 • Published Oct 6, 2024 • 2
Direct Multi-Turn Preference Optimization for Language Agents Paper • 2406.14868 • Published Jun 21, 2024
MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers Paper • 2508.20453 • Published 6 days ago • 52