Precision-2 speaker diarization

This pipeline runs Precision-2 speaker diarization on pyannoteAI cloud.
Read the announcement blog post.

This pipeline is a stripped down version of pyannoteAI SDK that provides much more features:

  • speaker diarization optimized for speech-to-text
  • speaker voiceprinting and identification
  • confidence scores
  • and more...

A self-hosted version of Precision-2 is also available for enterprise customers.

Setup

  1. pip install pyannote.audio
  2. Create an API key on pyannoteAI dashboard (free credits included)

Usage

# initialize speaker diarization pipeline
from pyannote.audio import Pipeline
pipeline = Pipeline.from_pretrained(
    'pyannote/speaker-diarization-precision-2', 
    token="{pyannoteAI-api-key}")

# run speaker diarization on pyannoteAI cloud
output = pipeline("/path/to/audio.wav")

# enjoy state-of-the-art speaker diarization
for turn, speaker in output.speaker_diarization:
    print(f"start={turn.start:.1f}s stop={turn.end:.1f}s speaker_{speaker}")
Downloads last month
1,009
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including pyannote/speaker-diarization-precision-2