Synthical
Your space
Profile
Activity
Favorites
Folders
Feeds
All articles
Simple
Original
Articles by
Naoyuki Kanda
E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS
Yesterday by
Sefik Emre Eskimez
and
others
Audio and Speech Processing
,
Sound
Observation of Gravitational Waves from the Coalescence of a
2.5\text{-}4.5~M_\odot
Compact Object and a Neutron Star
26 July 2024 by
Ligo Scientific Collaboration
and
others
High Energy Astrophysical Phenomena
,
General Relativity and Quantum Cosmology
Laugh Now Cry Later: Controlling Time-Varying Emotional States of Flow-Matching-Based Zero-Shot Text-to-Speech
17 July 2024 by
Haibin Wu
and
others
at
National Taiwan University
Audio and Speech Processing
,
Artificial Intelligence
Swift-BAT GUANO follow-up of gravitational-wave triggers in the third LIGO-Virgo-KAGRA observing run
13 July 2024 by
Gayathri Raman
and
others
High Energy Astrophysical Phenomena
,
General Relativity and Quantum Cosmology
SpeechX: Neural Codec Language Model as a Versatile Speech Transformer
25 June 2024 by
Xiaofei Wang
and
others
Audio and Speech Processing
,
Computation and Language
An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS
9 June 2024 by
Xiaofei Wang
and
others
at
Microsoft Research
Audio and Speech Processing
,
Artificial Intelligence
Total-Duration-Aware Duration Modeling for Text-to-Speech Systems
6 June 2024 by
Sefik Emre Eskimez
and
others
Audio and Speech Processing
Constraints on dark photon dark matter using data from LIGO's and Virgo's third observing run
6 May 2024 by
Ligo Scientific Collaboration
and
others
Cosmology and Nongalactic Astrophysics
,
General Relativity and Quantum Cosmology
Profile-Error-Tolerant Target-Speaker Voice Activity Detection
3 April 2024 by
Dongmei Wang
and
others
Sound
,
Audio and Speech Processing
Development of advanced photon calibrator for Kamioka gravitational wave detector (KAGRA)
30 March 2024 by
Yusuke Inoue
and
others
Instrumentation and Methods for Astrophysics
,
Instrumentation and Detectors
Ultralight vector dark matter search using data from the KAGRA O3GK run
5 March 2024 by
Ligo Scientific Collaboration
and
others
Cosmology and Nongalactic Astrophysics
,
General Relativity and Quantum Cosmology
Making Flow-Matching-Based Zero-Shot Text-to-Speech Laugh as You Like
4 March 2024 by
Naoyuki Kanda
and
others
Audio and Speech Processing
,
Computation and Language
Search for subsolar-mass black hole binaries in the second part of Advanced LIGO's and Advanced Virgo's third observing run
26 January 2024 by
Ligo Scientific Collaboration
and
others
High Energy Astrophysical Phenomena
,
Cosmology and Nongalactic Astrophysics
DiariST: Streaming Speech Translation with Speaker Diarization
22 January 2024 by
Mu Yang
and
others
at
University of Texas at Dallas
Audio and Speech Processing
,
Computation and Language
NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription
16 January 2024 by
Alon Vinnikov
and
others
at
Microsoft Research
Sound
,
Artificial Intelligence
GWTC-3: Compact Binary Coalescences Observed by LIGO and Virgo During the Second Part of the Third Observing Run
23 October 2023 by
Ligo Scientific Collaboration
and
others
General Relativity and Quantum Cosmology
,
High Energy Astrophysical Phenomena
Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation
23 October 2023 by
Sara Papi
and
others
Computation and Language
,
Sound
t-SOT FNT: Streaming Multi-talker ASR with Text-only Domain Adaptation Capability
15 September 2023 by
Jian Wu
and
others
Audio and Speech Processing
,
Sound
A Joint Fermi-GBM and Swift-BAT Analysis of Gravitational-Wave Candidates from the Third Gravitational-wave Observing Run
25 August 2023 by
Calvin Fletcher
and
others
High Energy Astrophysical Phenomena
Search for Eccentric Black Hole Coalescences during the Third Observing Run of LIGO and Virgo
7 August 2023 by
Ligo Scientific Collaboration
and
others
High Energy Astrophysical Phenomena
Handling Trade-Offs in Speech Separation with Sparsely-Gated Mixture of Experts
31 May 2023 by
Xiaofei Wang
and
others
Audio and Speech Processing
,
Sound
Adapting Multi-Lingual ASR Models for Handling Multiple Talkers
30 May 2023 by
Chenda Li
and
others
Audio and Speech Processing
,
Computation and Language
i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data
21 May 2023 by
Ziyi Yang
and
others
Computation and Language
,
Artificial Intelligence
Search for gravitational-lensing signatures in the full third observing run of the LIGO-Virgo network
17 April 2023 by
Ligo Scientific Collaboration
and
others
General Relativity and Quantum Cosmology
,
Cosmology and Nongalactic Astrophysics
Factual Consistency Oriented Speech Recognition
24 February 2023 by
Naoyuki Kanda
and
others
Audio and Speech Processing
,
Computation and Language
Open data from the third observing run of LIGO, Virgo, KAGRA and GEO
7 February 2023 by
Ligo Scientific Collaboration
and
others
General Relativity and Quantum Cosmology
Model-based cross-correlation search for gravitational waves from the low-mass X-ray binary Scorpius X-1 in LIGO O3 data
2 January 2023 by
Ligo Scientific Collaboration
and
others
High Energy Astrophysical Phenomena
,
General Relativity and Quantum Cosmology
Speech separation with large-scale self-supervised learning
26 November 2022 by
Zhuo Chen
and
others
Audio and Speech Processing
,
Computation and Language
Simulating realistic speech overlaps improves multi-talker ASR
17 November 2022 by
Muqiao Yang
and
others
Audio and Speech Processing
,
Computation and Language
Self-supervised learning with bi-label masked speech prediction for streaming multi-talker speech recognition
10 November 2022 by
Zili Huang
and
others
Audio and Speech Processing
,
Sound
Load more