Sign in

An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS

By Xiaofei Wang and others at
LogoMicrosoft Research
Recently, zero-shot text-to-speech (TTS) systems, capable of synthesizing any speaker's voice from a short audio prompt, have made rapid advancements. However, the quality of the generated speech significantly deteriorates when the audio prompt contains noise, and limited research has been conducted to address this issue. In this paper, we explored... Show more
June 9, 2024
=
0
Loading PDF…
Loading full text...
Similar articles
Loading recommendations...
=
0
x1
An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS
Click on play to start listening