Vision-Language-Action Models in Minecraft.
-
CraftJarvis/JarvisVLA-Qwen2-VL-7B
Image-Text-to-Text • Updated • 11 • 7 -
JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse
Paper • 2503.16365 • Published • 33 -
8
Minecraft VLM Leaderboard
🏢Display and filter LLM leaderboard for Minecraft models
-
CraftJarvis/minecraft-vla-sft
Viewer • Updated • 3.78M • 160 • 3