Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ZhenYE's picture
10 6 28

ZhenYE

ZhenYe234
SebastianBodza's profile picture lmxue's profile picture shwj's profile picture
·
https://github.com/zhenye234
  • zhenye234

AI & ML interests

None yet

Recent Activity

liked a dataset 23 days ago
OpenSound/CapSpeech
liked a model about 1 month ago
IndexTeam/Index-anisora
new activity about 1 month ago
HKUSTAudio/xcodec2:How to train?
View all activity

Organizations

HKUST Audio's profile picture

authored a paper 5 months ago

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Paper • 2502.04128 • Published Feb 6 • 26
authored 4 papers 7 months ago

CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model

Paper • 2305.06908 • Published May 11, 2023 • 6

CoMoSVC: Consistency Model-based Singing Voice Conversion

Paper • 2401.01792 • Published Jan 3, 2024 • 11

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Paper • 2404.14700 • Published Apr 23, 2024 • 33

Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Paper • 2408.17175 • Published Aug 30, 2024 • 4
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs