Sign in

Global Normalization for Streaming Speech Recognition in a Modular Framework

By Ehsan Variani and others
We introduce the Globally Normalized Autoregressive Transducer (GNAT) for addressing the label bias problem in streaming speech recognition. Our solution admits a tractable exact computation of the denominator for the sequence-level normalization. Through theoretical and empirical results, we demonstrate that by switching to a globally normalized model, the word error... Show more
May 26, 2022
=
0
Loading PDF…
Loading full text...
Similar articles
Loading recommendations...
=
0
x1
Global Normalization for Streaming Speech Recognition in a Modular Framework
Click on play to start listening