Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

tsinghua-ee
/
SALMONN-7B

Automatic Speech Recognition
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
Model card Files Files and versions Community
SALMONN-7B / resource /audio_demo
Ctrl+K
Ctrl+K
  • 2 contributors
History: 1 commit
tangchangli
chore: init repo
7cf7820 over 1 year ago
  • duck.wav
    640 kB
    chore: init repo over 1 year ago
  • excitement.wav
    40.4 kB
    chore: init repo over 1 year ago
  • gunshots.wav
    320 kB
    chore: init repo over 1 year ago
  • mountain.wav
    79.1 kB
    chore: init repo over 1 year ago
  • music.wav
    639 kB
    chore: init repo over 1 year ago