The Devil is in Temporal Token: High Quality Video Reasoning Segmentation

Sitong Gong 1  Yunzhi Zhuge 1  Lu Zhang 1  Zongxin Yang 2  Pingping Zhang 1  Huchuan Lu 1 

CVPR 2025

1 Dalian University of Technology   2 Havard University 

arXiv

You can find the code at: https://github.com/SitongGong/VRS-HQ

Downloads last month
4
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for SitongGong/VRS-HQ

Finetuned
(1)
this model