Open World Object Detection in the Era of Foundation Models Paper • 2312.05745 • Published Dec 10, 2023 • 1
Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision Paper • 2407.06189 • Published Jul 8 • 24
μ-Bench: A Vision-Language Benchmark for Microscopy Understanding Paper • 2407.01791 • Published Jul 1 • 5
MedAlign: A Clinician-Generated Dataset for Instruction Following with Electronic Medical Records Paper • 2308.14089 • Published Aug 27, 2023 • 28