Skip to content

--diarize flag no longer works with stereo input in latest release #3092

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
varlog11 opened this issue Apr 29, 2025 · 1 comment
Open

--diarize flag no longer works with stereo input in latest release #3092

varlog11 opened this issue Apr 29, 2025 · 1 comment

Comments

@varlog11
Copy link

varlog11 commented Apr 29, 2025

Environment:
whisper.cpp version: latest (1.7.5)
Platform: Debian 12, x86_64
Build flags: -D WHISPER_FFMPEG=yes -D GGML_CUDA=no
Test file: 2-channel stereo WAV with two clean channels

In version 1.7.4, the --diarize option correctly performed voice diarization on a stereo input file (splitting tracks by speaker/channel). After upgrading to the latest release, the same command no longer produces separate speaker outputs.

@danbev
Copy link
Collaborator

danbev commented Apr 30, 2025

Could you provide a sample audio file that I can use to reproduce this, and also what model are you using?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants