Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yansong Shi's picture
7 1 3

Yansong Shi

nanamma
Prettykittycat35's profile picture 21world's profile picture TheoW's profile picture
·
https://huggingface.co/nanamma

AI & ML interests

multi modality, video understanding, robotics

Recent Activity

upvoted a paper about 1 month ago
InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision
authored a paper 3 months ago
InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding
authored a paper 3 months ago
TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning
View all activity

Organizations

OpenGVLab's profile picture

authored 2 papers 3 months ago

InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding

Paper • 2403.15377 • Published Mar 22, 2024 • 27

TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning

Paper • 2410.19702 • Published Oct 25, 2024 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs