Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

moonshotai
/
Kimi-Audio-7B-Instruct

Text-to-Speech
KimiAudio
Safetensors
English
Chinese
audio
audio-language-model
speech-recognition
audio-understanding
audio-generation
chat
custom_code
Model card Files Files and versions Community
18
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Fix incorrect unk_id assignment

#16 opened 18 days ago by
codecho

Request: DOI

#14 opened 30 days ago by
huseyinyolcu

supported languages?

👍 1
#12 opened about 1 month ago by
nononameneeded2001

About the weight files of the Whisper Encoder

1
#11 opened about 1 month ago by
codecho

how can I fine tune this for farsi?

#10 opened about 1 month ago by
uncleMehrzad

Cannot Run Model in Hugging Face Spaces: AutoProcessor/Processor Not Found

#9 opened about 1 month ago by
ranagame

Будет ли поддержка Русского языка?

#8 opened about 1 month ago by
fduches2

A video on how to set up this in a Colab notebook

1
#7 opened about 1 month ago by
ritheshSree

Vocoder Architecture?

#6 opened about 1 month ago by
yukiarimo

Base model?

1
#4 opened about 1 month ago by
deltanym

Issue with long audio (~1 min) output, or prompt instruct following

👀 1
2
#2 opened about 1 month ago by
JosephusCheung

Update correct task tag

1
#1 opened about 1 month ago by
reach-vb
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs