Sign in

Self-supervised learning with bi-label masked speech prediction for streaming multi-talker speech recognition

By Zili Huang and others
Self-supervised learning (SSL), which utilizes the input data itself for representation learning, has achieved state-of-the-art results for various downstream speech tasks. However, most of the previous studies focused on offline single-talker applications, with limited investigations in multi-talker cases, especially for streaming scenarios. In this paper, we investigate SSL for streaming... Show more
November 10, 2022
=
0
Loading PDF…
Loading full text...
Similar articles
Loading recommendations...
=
0
x1
Self-supervised learning with bi-label masked speech prediction for streaming multi-talker speech recognition
Click on play to start listening