Add pyannote orchestration by EduardoPach · Pull Request #92 · argmaxinc/OpenBench

EduardoPach · 2026-01-13T18:32:58Z

What does this PR do?

This PR introduces a reusable PyannoteAI engine and adds support for PyannoteAI's new STT orchestration feature, which combines speaker diarization with transcription in a single API call.

Changes

New Engine (src/openbench/engine/pyannote_engine.py)

Created PyannoteAIApi engine class that supports both diarization-only and diarization+transcription modes
Added response models for API outputs:
- PyannoteApiDiarizationOutput - diarization-only responses
- PyannoteApiOrchestrationOutput - responses with wordLevelTranscription and turnLevelTranscription

Refactored Diarization Pipeline

Updated PyannoteApiPipeline to use the new engine instead of duplicating API logic

New Pipelines

PyannoteTranscriptionPipeline - Uses PyannoteAI with STT but ignores speaker attribution (for transcription-only datasets)
PyannoteOrchestrationPipeline - Uses PyannoteAI with STT and includes speaker attribution

Pipeline Aliases

pyannote-transcription - Transcription without speaker labels
pyannote-orchestration - Full diarization + transcription with speaker attribution

Usage

Transcription only (no speaker labels)

openbench-cli evaluate -p pyannote-transcription -d <dataset-name> -m wer

Orchestration (with speaker labels)

openbench-cli evaluate -p pyannote-orchestration -d <dataset-name> -m wer -m cpwer -m wder

…estration

dbrkn

lgtm

add: pyannote orchestration

6625b62

EduardoPach requested review from atiorh and dbrkn January 13, 2026 18:33

fix: year in copyright

6a26b20

EduardoPach changed the title ~~add: pyannote orchestration~~ Add pyannote orchestration Jan 13, 2026

Merge remote-tracking branch 'origin/main' into eduardo/pyannote-orch…

6bc5705

…estration

dbrkn approved these changes Jan 19, 2026

View reviewed changes

EduardoPach merged commit 18c7372 into main Jan 19, 2026
2 checks passed

EduardoPach deleted the eduardo/pyannote-orchestration branch January 19, 2026 14:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Add pyannote orchestration#92

Add pyannote orchestration#92
EduardoPach merged 3 commits intomainfrom
eduardo/pyannote-orchestration

EduardoPach commented Jan 13, 2026

Uh oh!

dbrkn left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

EduardoPach commented Jan 13, 2026

What does this PR do?

Changes

Usage

Transcription only (no speaker labels)

Orchestration (with speaker labels)

Uh oh!

dbrkn left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants