-
π_0: A Vision-Language-Action Flow Model for General Robot Control
Paper • 2410.24164 • Published • 11 -
Magma: A Foundation Model for Multimodal AI Agents
Paper • 2502.13130 • Published • 58 -
Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Paper • 2310.08864 • Published • 2 -
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
Paper • 2502.13143 • Published • 30
zz
chocolala
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
14 days ago
Enhanced OoD Detection through Cross-Modal Alignment of Multi-Modal
Representations
upvoted
a
paper
14 days ago
TransMamba: Flexibly Switching between Transformer and Mamba
Organizations
Collections
1
models
0
None public yet
datasets
0
None public yet