zhang's picture

1 5 14

zhang

landy123007

·

AI & ML interests

None yet

Recent Activity

commented on a paper 7 days ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

upvoted an article 8 days ago

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

liked a Space 19 days ago

google/rad_explain

View all activity

Organizations

None yet

landy123007's activity

upvoted an article 8 days ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

By

and 8 others •

9 days ago

• 120

upvoted an article 10 months ago

Article

Scaling robotics datasets with video encoding

By

and 2 others •

Aug 27, 2024

• 40

upvoted 2 papers 10 months ago

Sapiens: Foundation for Human Vision Models

Paper • 2408.12569 • Published Aug 22, 2024 • 92

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Paper • 2408.11039 • Published Aug 20, 2024 • 62

upvoted an article 11 months ago

Article

MobileNet Baselines

By

•

Jul 26, 2024

• 24