Speaker diarization
Relies on pyannote.audio 2.0 currently in development: see installation instructions.
from pyannote.audio import Pipeline
pipeline = Pipeline.from_pretrained("AMITKESARI2000/pyannote_SD1")
output = pipeline("audio.wav")
for turn, _, speaker in output.itertracks(yield_label=True):
# speaker speaks between turn.start and turn.end
...
Benchmark
Dataset | Diarization error rate |
---|---|
AMI only_words evaluation set |
21.4% |
- Downloads last month
- 4