Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
espnet 's Collections
OpusLM
UniVERSA
Codec Survey - Pre-trained Models
OWSM: Fully Open Speech Recognition and Translation Models
OWLS: Scaling Laws for Speech Recognition and Translation
OWSM-CTC: Ultra-Fast Speech Foundation Models
Neural Codecs
XEUS Model and Data

OWLS: Scaling Laws for Speech Recognition and Translation

updated May 3

🦉 A suite of Whisper-style models from 250M to 18B parameters. Trained on up to 360K hours of data. 16k sampling rate.

Upvote
7

  • espnet/owls_4B_180K

    Automatic Speech Recognition • Updated May 3 • 19 • 5

  • espnet/owls_9B_180K

    Automatic Speech Recognition • Updated May 3 • 17

  • espnet/owls_05B_180K

    Automatic Speech Recognition • Updated May 3 • 9

  • espnet/owls_025B_180K

    Automatic Speech Recognition • Updated May 3 • 9

  • espnet/owls_1B_180K

    Automatic Speech Recognition • Updated May 3 • 11 • 3

  • espnet/owls_2B_180K

    Automatic Speech Recognition • Updated May 3 • 11

  • espnet/owls_18B_180K

    Automatic Speech Recognition • Updated May 3 • 7 • 1

  • espnet/owls_18B_360K

    Automatic Speech Recognition • Updated May 3 • 13 • 1
Upvote
7
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs