Sign in

Profile-Error-Tolerant Target-Speaker Voice Activity Detection

By Dongmei Wang and others
Target-Speaker Voice Activity Detection (TS-VAD) utilizes a set of speaker profiles alongside an input audio signal to perform speaker diarization. While its superiority over conventional methods has been demonstrated, the method can suffer from errors in speaker profiles, as those profiles are typically obtained by running a traditional clustering-based diarization... Show more
April 3, 2024
=
0
Loading PDF…
Loading full text...
Similar articles
Loading recommendations...
=
0
x1
Profile-Error-Tolerant Target-Speaker Voice Activity Detection
Click on play to start listening