Sign in

Attention-based conditioning methods using variable frame rate for style-robust speaker verification

By Amber Afshan and Abeer Alwan
We propose an approach to extract speaker embeddings that are robust to speaking style variations in text-independent speaker verification. Typically, speaker embedding extraction includes training a DNN for speaker classification and using the bottleneck features as speaker representations. Such a network has a pooling layer to transform frame-level to utterance-level... Show more
June 28, 2022
=
0
Loading PDF…
Loading full text...
Similar articles
Loading recommendations...
=
0
x1
Attention-based conditioning methods using variable frame rate for style-robust speaker verification
Click on play to start listening