Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yuhao Zhang's picture
2 6 2

Yuhao Zhang

Yoohao
·
https://xiaozhang521.github.io/

AI & ML interests

Speech, NLP, Machine translation

Organizations

FreedomAI's profile picture

Yoohao's activity

upvoted a paper 2 months ago

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27 • 78
upvoted 2 papers 3 months ago

Soundwave: Less is More for Speech-Text Alignment in LLMs

Paper • 2502.12900 • Published Feb 18 • 86

S2S-Arena, Evaluating Speech2Speech Protocols on Instruction Following with Paralinguistic Information

Paper • 2503.05085 • Published Mar 7 • 48
upvoted a paper 8 months ago

Roadmap towards Superhuman Speech Understanding using Large Language Models

Paper • 2410.13268 • Published Oct 17, 2024 • 35
upvoted a paper 9 months ago

LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture

Paper • 2409.02889 • Published Sep 4, 2024 • 55
upvoted a paper 11 months ago

CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis

Paper • 2407.13301 • Published Jul 18, 2024 • 57
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs