Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
zhidong-gao 's Collections
Medical
Video
3D
SD
Efficient
Audio
Attack
LLMs
dataset
align
Agent

Audio

updated Aug 15, 2024
Upvote
-

  • NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models

    Paper • 2403.03100 • Published Mar 5, 2024 • 38

  • MooER: LLM-based Speech Recognition and Translation Models from Moore Threads

    Paper • 2408.05101 • Published Aug 9, 2024 • 8
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs