Learning Adaptive Parallel Reasoning with Language Models Paper • 2504.15466 • Published Apr 21 • 42 • 2
Describe Anything: Detailed Localized Image and Video Captioning Paper • 2504.16072 • Published Apr 22 • 60 • 4
Rethinking Patch Dependence for Masked Autoencoders Paper • 2401.14391 • Published Jan 25, 2024 • 27 • 2
Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual Grouping Paper • 2304.08025 • Published Apr 17, 2023 • 2 • 1