LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries Paper • 2508.15760 • Published 1 day ago • 25 • 2
Preference Learning Unlocks LLMs' Psycho-Counseling Skills Paper • 2502.19731 • Published Feb 27 • 7 • 2
CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapy Paper • 2410.13218 • Published Oct 17, 2024 • 4 • 2