Sign in

Multi-channel Opus compression for far-field automatic speech recognition with a fixed bitrate budget

By Lukas Drude and others
Automatic speech recognition (ASR) in the cloud allows the use of larger models and more powerful multi-channel signal processing front-ends compared to on-device processing. However, it also adds an inherent latency due to the transmission of the audio signal, especially when transmitting multiple channels of a microphone array. One way... Show more
June 15, 2021
Loading PDF…
Loading full text...
Similar articles
Loading recommendations...
Multi-channel Opus compression for far-field automatic speech recognition with a fixed bitrate budget
Click on play to start listening