Trans-EnV: A Framework for Evaluating the Linguistic Robustness of LLMs Against English Varieties Paper • 2505.20875 • Published May 27 • 4
CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays Paper • 2505.18087 • Published May 23 • 7
Lunguage: A Benchmark for Structured and Sequential Chest X-ray Interpretation Paper • 2505.21190 • Published May 27 • 4
PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions Paper • 2505.17818 • Published May 23 • 11