PUG: Photorealistic and Semantically Controllable Synthetic Data for Representation Learning Paper • 2308.03977 • Published Aug 8, 2023
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding Paper • 2410.17434 • Published Oct 22, 2024 • 30
Improving the Scaling Laws of Synthetic Data with Deliberate Practice Paper • 2502.15588 • Published Feb 21
Running 14 14 Leaderboard: Physical Reasoning from Video 🏃 Submit and score model predictions for video and text tasks
Running 14 14 Leaderboard: Physical Reasoning from Video 🏃 Submit and score model predictions for video and text tasks
Running 14 14 Leaderboard: Physical Reasoning from Video 🏃 Submit and score model predictions for video and text tasks