Synthical
Your space
Profile
Activity
Favorites
Folders
Feeds
All articles
Simple
Original
Articles about
Sound
Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis
Today by
Ziyue Jiang
and
others
Audio and Speech Processing
,
Machine Learning
Zero-Shot Audio-Visual Editing via Cross-Modal Delta Denoising
Yesterday by
Yan-Bo Lin
and
others
Computer Vision and Pattern Recognition
,
Machine Learning
Benchmarking Machine Learning Methods for Distributed Acoustic Sensing
Yesterday by
Shuaikai Shi
and
Qijun Zong
Audio and Speech Processing
,
Computer Vision and Pattern Recognition
FireRedTTS-1S: An Upgraded Streamable Foundation Text-to-Speech System
Yesterday by
Hao-Han Guo
and
others
Sound
,
Audio and Speech Processing
QualiSpeech: A Speech Quality Assessment Dataset with Natural Language Reasoning and Descriptions
Yesterday by
Siyin Wang
and
others
Audio and Speech Processing
,
Artificial Intelligence
Qwen2.5-Omni Technical Report
Yesterday by
Jin Xu
and
others
Computation and Language
,
Computer Vision and Pattern Recognition
Quantification of Tenseness in English and Japanese Tense-Lax Vowels: A Lagrangian Model with Indicator θ1 and Force of Tenseness Ftense(t)
2 days ago by
Tatsuya Ishizaki
Computation and Language
,
Sound
Deep Learning for Speech Emotion Recognition: A CNN Approach Utilizing Mel Spectrograms
2 days ago by
Niketa Penumajji
Sound
,
Artificial Intelligence
A Comprehensive Survey with Critical Analysis for Deepfake Speech Detection
2 days ago by
Lam Pham
and
others
Sound
,
Audio and Speech Processing
Analyzable Chain-of-Musical-Thought Prompting for High-Fidelity Music Generation
2 days ago by
Max W. Y. Lam
and
others
Sound
,
Artificial Intelligence
A Generalist Audio Foundation Model for Comprehensive Body Sound Auscultation
2 days ago by
Pingjie Wang
and
others
Sound
,
Audio and Speech Processing
Boosting the Transferability of Audio Adversarial Examples with Acoustic Representation Optimization
2 days ago by
Weifei Jin
and
others
Sound
,
Cryptography and Security
Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation with Diffusion Models
2 days ago by
Jean-Marie Lemercier
and
others
Audio and Speech Processing
,
Machine Learning
Pitch Contour Exploration Across Audio Domains: A Vision-Based Transfer Learning Approach
3 days ago by
Jakob Abeßer
and
others
Audio and Speech Processing
,
Sound
SpeechVerse: A Large-scale Generalizable Audio Language Model
3 days ago by
Nilaksh Das
and
others
Computation and Language
,
Sound
A Reliable and Efficient Detection Pipeline for Rodent Ultrasonic Vocalizations
3 days ago by
Sabah Shahnoor Anis
and
others
Sound
,
Audio and Speech Processing
Seeing Speech and Sound: Distinguishing and Locating Audios in Visual Scenes
3 days ago by
Hyeonggon Ryu
and
others
Computer Vision and Pattern Recognition
,
Sound
CCMusic: An Open and Diverse Database for Chinese Music Information Retrieval Research
3 days ago by
Monan Zhou
and
others
Information Retrieval
,
Sound
Wireless Hearables With Programmable Speech AI Accelerators
3 days ago by
Malek Itani
and
others
Sound
,
Audio and Speech Processing
Music Similarity Representation Learning Focusing on Individual Instruments with Source Separation and Human Preference
3 days ago by
Takehiro Imamura
and
others
Sound
,
Audio and Speech Processing
STA-V2A: Video-to-Audio Generation with Semantic and Temporal Alignment
3 days ago by
Yong Ren
and
others
Sound
,
Multimedia
MusicEval: A Generative Music Dataset with Expert Ratings for Automatic Text-to-Music Evaluation
4 days ago by
Cheng Liu
and
others
Sound
,
Audio and Speech Processing
DiffusionTalker: Efficient and Compact Speech-Driven 3D Talking Head via Personalizer-Guided Distillation
4 days ago by
Peng Chen
and
others
Computer Vision and Pattern Recognition
,
Artificial Intelligence
Machine learning based animal emotion classification using audio signals
4 days ago by
Mariia Slobodian
and
Mykola Kozlenko
Sound
,
Machine Learning
Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques
4 days ago by
Yuanchao Li
and
others
at
University of Edinburgh
Audio and Speech Processing
,
Computation and Language
Anomaly Detection and Localization for Speech Deepfakes via Feature Pyramid Matching
4 days ago by
Emma Coletta
and
others
Sound
,
Computer Vision and Pattern Recognition
A State-of-the-Art Review on Acoustic Preservation of Historical Worship Spaces through Auralization
4 days ago by
Hannes Rosseel
and
Toon Van Waterschoot
Audio and Speech Processing
,
Sound
CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval
4 days ago by
Mohammad Mahdi Abootorabi
and
Ehsaneddin Asgari
Computation and Language
,
Information Retrieval
Heterogeneous bimodal attention fusion for speech emotion recognition
4 days ago by
Jiachen Luo
and
others
Sound
,
Artificial Intelligence
Elevating Robust Multi-Talker ASR by Decoupling Speaker Separation and Speech Recognition
5 days ago by
Yufeng Yang
and
others
Sound
,
Audio and Speech Processing
Load more