Sign in

Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss

By Qian Zhang and others
In this paper we present an end-to-end speech recognition model with Transformer encoders that can be used in a streaming speech recognition system. Transformer computation blocks based on self-attention are used to encode both audio and label sequences independently. The activations from both audio and label encoders are combined with... Show more
February 14, 2020
=
0
Loading PDF…
Loading full text...
Similar articles
Loading recommendations...
=
0
x1
Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss
Click on play to start listening