Reducing Hallucinations in Vision-Language Models via Latent Space Steering Paper • 2410.15778 • Published Oct 21, 2024 • 1
OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning Paper • 2502.11271 • Published Feb 16 • 18
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine Paper • 2408.02900 • Published Aug 6, 2024 • 31