README.md · techiaith/whisper-large-v3-ft-verbatim-cy-en-ct2 at main

metadata

license: apache-2.0
language:
  - cy
  - en
base_model:
  - techiaith/whisper-large-v3-ft-verbatim-cy-en
pipeline_tag: automatic-speech-recognition
tags:
  - faster-whisper

Model Name: whisper-large-v3-ft-verbatim-cy-en-ct2

Model Description:

This model card describes whisper-large-v3-ft-verbatim-cy-en-ct2, a conversion of the techiaith/whisper-large-v3-ft-verbatim-cy-en fine-tuned OpenAI whisper model to the CTranslate2 format. This conversion allows for significantly faster and more efficient inference, particularly on CPU and with batching.

How to Use:

from faster_whisper import WhisperModel

audio_file_path=<path to your audio file>

model = WhisperModel("techiaith/whisper-large-v3-ft-verbatim-cy-en-ct2")
segments, info = model.transcribe(audio_file_path, beam_size=5)

print("Detected language '%s' with probability %f" % (info.language, info.language_probability))
for segment in segments:
    print("[%.2fs -> %.2fs] %s" % (segment.start, segment.end, segment.text))

Detected language 'cy' with probability 0.999987
[0.00s -> 4.24s] Dwi teimlo weithie unwaith ti'n cyfadda bo' na rwbath yn bod ma'n wir wedyn dydi?