Depth Any Video with Scalable Synthetic Data
State-of-the-art target speech extractor
Generate football pitch minimaps from images
Small Space to test ViTPose
Detect and annotate poses in images and videos