Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks Paper • 2501.08326 • Published 23 days ago • 31
Running on Zero 160 160 Marigold-LCM Depth Estimation 🏵 Generate 3D depth maps from images and videos