From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models Paper • 2506.09930 • Published 16 days ago • 8
SAFE: Multitask Failure Detection for Vision-Language-Action Models Paper • 2506.09937 • Published 16 days ago • 9
Hidden in plain sight: VLMs overlook their visual representations Paper • 2506.08008 • Published 18 days ago • 8
RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation Paper • 2506.18088 • Published 5 days ago • 16