Synthical
Your space
Profile
Activity
Favorites
Folders
Feeds
All articles
Claim page
Naoyuki Kanda
Follow
Activity
Upvotes
Folders
Articles
7
Laugh Now Cry Later: Controlling Time-Varying Emotional States of Flow-Matching-Based Zero-Shot Text-to-Speech
17 September 2024 by
Haibin Wu
and
others
at
National Taiwan University
Audio and Speech Processing
,
Artificial Intelligence
E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS
12 September 2024 by
Sefik Emre Eskimez
and
others
Audio and Speech Processing
,
Sound
SpeechX: Neural Codec Language Model as a Versatile Speech Transformer
25 June 2024 by
Xiaofei Wang
and
others
Audio and Speech Processing
,
Computation and Language
An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS
9 June 2024 by
Xiaofei Wang
and
others
at
Microsoft Research
Audio and Speech Processing
,
Artificial Intelligence
Total-Duration-Aware Duration Modeling for Text-to-Speech Systems
6 June 2024 by
Sefik Emre Eskimez
and
others
Audio and Speech Processing
Constraints on dark photon dark matter using data from LIGO's and Virgo's third observing run
6 May 2024 by
Ligo Scientific Collaboration
and
others
Cosmology and Nongalactic Astrophysics
,
General Relativity and Quantum Cosmology
Profile-Error-Tolerant Target-Speaker Voice Activity Detection
3 April 2024 by
Dongmei Wang
and
others
Sound
,
Audio and Speech Processing
Development of advanced photon calibrator for Kamioka gravitational wave detector (KAGRA)
30 March 2024 by
Yoshiyuki Inoue
and
others
Instrumentation and Methods for Astrophysics
,
Instrumentation and Detectors
Ultralight vector dark matter search using data from the KAGRA O3GK run
5 March 2024 by
Ligo Scientific Collaboration
and
others
Cosmology and Nongalactic Astrophysics
,
General Relativity and Quantum Cosmology
Making Flow-Matching-Based Zero-Shot Text-to-Speech Laugh as You Like
4 March 2024 by
Naoyuki Kanda
and
others
Audio and Speech Processing
,
Computation and Language
DiariST: Streaming Speech Translation with Speaker Diarization
22 January 2024 by
Mu Yang
and
others
at
University of Texas at Dallas
Audio and Speech Processing
,
Computation and Language
NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription
16 January 2024 by
Alon Vinnikov
and
others
at
Microsoft Research
Sound
,
Artificial Intelligence
This is an AI-generated summary
Key points
Topics
We have not analyzed this profile yet, please check back later