NaVILA: Legged Robot Vision-Language-Action Model for Naviga a8cheng/navila-llama3-8b-8f Updated Mar 11 • 509 • 4 a8cheng/navila-qwen2-7b-64k-64f Updated Mar 11 • 1 a8cheng/navila-siglip-llama3-8b-v1.5-pretrain Updated Jul 6 • 1.03k
SpatialRGPT: Grounded Spatial Reasoning in VLMs a8cheng/SpatialRGPT-VILA1.5-8B Updated Oct 6, 2024 • 768 • 5 a8cheng/OpenSpatialDataset Updated Oct 3, 2024 • 64 • 9 a8cheng/SpatialRGPT-Bench Viewer • Updated May 5 • 1.41k • 189 • 10
NaVILA: Legged Robot Vision-Language-Action Model for Naviga a8cheng/navila-llama3-8b-8f Updated Mar 11 • 509 • 4 a8cheng/navila-qwen2-7b-64k-64f Updated Mar 11 • 1 a8cheng/navila-siglip-llama3-8b-v1.5-pretrain Updated Jul 6 • 1.03k
SpatialRGPT: Grounded Spatial Reasoning in VLMs a8cheng/SpatialRGPT-VILA1.5-8B Updated Oct 6, 2024 • 768 • 5 a8cheng/OpenSpatialDataset Updated Oct 3, 2024 • 64 • 9 a8cheng/SpatialRGPT-Bench Viewer • Updated May 5 • 1.41k • 189 • 10