NaVILA: Legged Robot Vision-Language-Action Model for Naviga a8cheng/navila-llama3-8b-8f Updated Mar 11 • 562 • 4 a8cheng/navila-qwen2-7b-64k-64f Updated Mar 11 • 1 a8cheng/navila-siglip-llama3-8b-v1.5-pretrain Updated Jul 6 • 913
SpatialRGPT: Grounded Spatial Reasoning in VLMs a8cheng/SpatialRGPT-VILA1.5-8B Updated Oct 6, 2024 • 806 • 5 a8cheng/OpenSpatialDataset Updated Oct 3, 2024 • 59 • 9 a8cheng/SpatialRGPT-Bench Viewer • Updated May 5 • 1.41k • 203 • 10
NaVILA: Legged Robot Vision-Language-Action Model for Naviga a8cheng/navila-llama3-8b-8f Updated Mar 11 • 562 • 4 a8cheng/navila-qwen2-7b-64k-64f Updated Mar 11 • 1 a8cheng/navila-siglip-llama3-8b-v1.5-pretrain Updated Jul 6 • 913
SpatialRGPT: Grounded Spatial Reasoning in VLMs a8cheng/SpatialRGPT-VILA1.5-8B Updated Oct 6, 2024 • 806 • 5 a8cheng/OpenSpatialDataset Updated Oct 3, 2024 • 59 • 9 a8cheng/SpatialRGPT-Bench Viewer • Updated May 5 • 1.41k • 203 • 10