LlamaSeg: Image Segmentation via Autoregressive Mask Generation Paper • 2505.19422 • Published May 26 • 3
VisNumBench: Evaluating Number Sense of Multimodal Large Language Models Paper • 2503.14939 • Published Mar 19 • 5