Synthical
Your space
Profile
Activity
Favorites
Folders
Feeds
All articles
Claim page
Sefik Emre Eskimez
Follow
Activity
Upvotes
Folders
Articles
23
Laugh Now Cry Later: Controlling Time-Varying Emotional States of Flow-Matching-Based Zero-Shot Text-to-Speech
17 September 2024 by
Haibin Wu
and
others
at
National Taiwan University
Audio and Speech Processing
,
Artificial Intelligence
E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS
12 September 2024 by
Sefik Emre Eskimez
and
others
Audio and Speech Processing
,
Sound
Target conversation extraction: Source separation using turn-taking dynamics
29 July 2024 by
Tuochao Chen
and
others
Computation and Language
,
Audio and Speech Processing
Knowledge boosting during low-latency inference
25 July 2024 by
Vidya Srinivas
and
others
at
Microsoft Research
Machine Learning
,
Sound
SpeechX: Neural Codec Language Model as a Versatile Speech Transformer
25 June 2024 by
Xiaofei Wang
and
others
Audio and Speech Processing
,
Computation and Language
An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS
9 June 2024 by
Xiaofei Wang
and
others
at
Microsoft Research
Audio and Speech Processing
,
Artificial Intelligence
Total-Duration-Aware Duration Modeling for Text-to-Speech Systems
6 June 2024 by
Sefik Emre Eskimez
and
others
Audio and Speech Processing
Making Flow-Matching-Based Zero-Shot Text-to-Speech Laugh as You Like
4 March 2024 by
Naoyuki Kanda
and
others
Audio and Speech Processing
,
Computation and Language
Real-Time Joint Personalized Speech Enhancement and Acoustic Echo Cancellation
25 May 2023 by
Sefik Emre Eskimez
and
others
Audio and Speech Processing
,
Sound
Real-Time Audio-Visual End-to-End Speech Enhancement
13 March 2023 by
Zirun Zhu
and
others
Audio and Speech Processing
Speech separation with large-scale self-supervised learning
26 November 2022 by
Zhuo Chen
and
others
Audio and Speech Processing
,
Computation and Language
Breaking the trade-off in personalized speech enhancement with cross-task knowledge distillation
5 November 2022 by
Hassan Taherian
and
others
Audio and Speech Processing
,
Sound
Leveraging Real Conversational Data for Multi-Channel Continuous Speech Separation
7 April 2022 by
Xiaofei Wang
and
others
Audio and Speech Processing
,
Artificial Intelligence
Fast Real-time Personalized Speech Enhancement: End-to-End Enhancement Network (E3Net) and Knowledge Distillation
2 April 2022 by
Manthan Thakker
and
others
Audio and Speech Processing
,
Sound
ICASSP 2022 Deep Noise Suppression Challenge
27 February 2022 by
Harishchandra Dubey
and
others
Audio and Speech Processing
,
Sound
Sequence-level self-learning with multiple hypotheses
10 December 2021 by
Kenichi Kumatani
and
others
Computation and Language
,
Artificial Intelligence
Separating Long-Form Speech with Group-Wise Permutation Invariant Training
17 November 2021 by
Wangyou Zhang
and
others
Audio and Speech Processing
,
Sound
One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement
20 October 2021 by
Hassan Taherian
and
others
Audio and Speech Processing
,
Sound
Personalized Speech Enhancement: New Models and Comprehensive Evaluation
18 October 2021 by
Sefik Emre Eskimez
and
others
Audio and Speech Processing
,
Machine Learning
All-neural beamformer for continuous speech separation
13 October 2021 by
Zhuohuang Zhang
and
others
Audio and Speech Processing
,
Sound
Speech Driven Talking Face Generation from a Single Image and an Emotion Condition
21 July 2021 by
Sefik Emre Eskimez
and
others
Audio and Speech Processing
,
Computer Vision and Pattern Recognition
Dynamic Gradient Aggregation for Federated Domain Adaptation
14 June 2021 by
Dimitrios Dimitriadis
and
others
Machine Learning
,
Distributed, Parallel, and Cluster Computing
Human Listening and Live Captioning: Multi-Task Training for Speech Enhancement
5 June 2021 by
Sefik Emre Eskimez
and
others
Audio and Speech Processing
Federated Transfer Learning with Dynamic Gradient Aggregation
6 August 2020 by
Dimitrios Dimitriadis
and
others
Machine Learning
,
Distributed, Parallel, and Cluster Computing
Improving Readability for Automatic Speech Recognition Transcription
9 April 2020 by
Junwei Liao
and
others
Computation and Language
Generating Talking Face Landmarks from Speech
23 April 2018 by
Sefik Emre Eskimez
and
others
Computer Vision and Pattern Recognition
Emotion Classification: How Does an Automated System Compare to Naive Human Coders?
22 January 2016 by
Sefik Emre Eskimez
and
others
Human-Computer Interaction
This is an AI-generated summary
Key points
Topics
Audio and Speech Processing
Sound
Machine Learning
Computation and Language
Artificial Intelligence
Signal Processing
Computer Vision and Pattern Recognition
Distributed, Parallel, and Cluster Computing
Multimedia
Human-Computer Interaction