Probing the 3D Awareness of Visual Foundation Models Paper • 2404.08636 • Published Apr 12, 2024 • 14
VideoChat-R1 Collection VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning • 3 items • Updated Apr 22 • 5
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb • Nov 28, 2024 • 139
Cosmos-Tokenizer Collection A suite of image and video tokenizers • 13 items • Updated 3 days ago • 40
BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation Paper • 2407.17952 • Published Jul 25, 2024 • 33