Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Wenyi Hong's picture
8 4 2

Wenyi Hong

wenyi
21world's profile picture JLouisBiz's profile picture pierrci's profile picture
·
  • wenyihong

AI & ML interests

multi-modal, pretrain

Organizations

Z.ai & THUKEG's profile picture

wenyi's activity

upvoted a paper 5 months ago

MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models

Paper • 2501.02955 • Published Jan 6 • 45
upvoted 2 papers 10 months ago

CogVLM2: Visual Language Models for Image and Video Understanding

Paper • 2408.16500 • Published Aug 29, 2024 • 58

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

Paper • 2408.06072 • Published Aug 12, 2024 • 40
upvoted a paper over 1 year ago

CogAgent: A Visual Language Model for GUI Agents

Paper • 2312.08914 • Published Dec 14, 2023 • 31
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs