Self-Training Enables Video Instruction Tuning with Any Supervision
Orr Zohar PRO
orrzohar
AI & ML interests
Large Multi-Modal Models, Foundation Models, Video Understanding
Recent Activity
liked
a model
1 day ago
Qwen/Qwen3-VL-Embedding-8B
upvoted
a
paper
22 days ago
OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation
upvoted
a
paper
22 days ago
Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model