Synthical
Your space
Profile
Activity
Favorites
Folders
Feeds
All articles
Simple
Original
Articles by
Ankur Bapna
STAB: Speech Tokenizer Assessment Benchmark
3 days ago by
Shikhar Vashishth
and
others
Computation and Language
,
Sound
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
8 August 2024 by
Gemini Team
and
others
at
Google
Computation and Language
,
Artificial Intelligence
Gemini: A Family of Highly Capable Multimodal Models
1
20 May 2024 by
Gemini Team
and
others
at
Google
Computation and Language
,
Artificial Intelligence
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
25 September 2023 by
Yu Zhang
and
others
Computation and Language
,
Sound
Multimodal Modeling For Spoken Language Identification
19 September 2023 by
Shikhar Bharadwaj
and
others
Computation and Language
,
Machine Learning
MADLAD-400: A Multilingual And Document-Level Large Audited Dataset
9 September 2023 by
Sneha Kudugunta
and
others
at
Google
Computation and Language
,
Machine Learning
Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations
14 August 2023 by
Yuma Koizumi
and
others
Sound
,
Machine Learning
Mu
^{2}
SLAM: Multitask, Multilingual Speech and Language Models
27 June 2023 by
Yong Cheng
and
others
Computation and Language
,
Sound
AudioPaLM: A Large Language Model That Can Speak and Listen
22 June 2023 by
Paul Rubenstein
and
others
Computation and Language
,
Artificial Intelligence
Label Aware Speech Representation Learning For Language Identification
7 June 2023 by
Shikhar Vashishth
and
others
Computation and Language
,
Machine Learning
SQuId: Measuring Speech Naturalness in Many Languages
1 June 2023 by
Thibault Sellam
and
others
Computation and Language
,
Machine Learning
LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus
30 May 2023 by
Yuma Koizumi
and
others
at
Google
Audio and Speech Processing
,
Sound
Understanding Shared Speech-Text Representations
27 April 2023 by
Gary Wang
and
others
Computation and Language
,
Machine Learning
Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech
15 March 2023 by
Takaaki Saeki
and
others
Sound
,
Computation and Language
Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
21 October 2022 by
Zhehuai Chen
and
others
Computation and Language
,
Sound
JOIST: A Joint Speech and Text Streaming Model For ASR
13 October 2022 by
Tara Sainath
and
others
Computation and Language
,
Sound
MAESTRO: Matched Speech Text Representations through Modality Matching
1 July 2022 by
Zhehuai Chen
and
others
Computation and Language
,
Sound
Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation
27 June 2022 by
Ye Jia
and
others
Computation and Language
,
Machine Learning
FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech
25 May 2022 by
Alexis Conneau
and
others
Computation and Language
,
Machine Learning
Multilingual Document-Level Translation Enables Zero-Shot Transfer From Sentences to Documents
17 May 2022 by
Biao Zhang
and
others
Computation and Language
,
Machine Learning
Building Machine Translation Systems for the Next Thousand Languages
16 May 2022 by
Ankur Bapna
and
others
Computation and Language
,
Artificial Intelligence
XTREME-S: Evaluating Cross-lingual Speech Representations
22 March 2022 by
Alexis Conneau
and
others
Computation and Language
Multilingual Mix: Example Interpolation Improves Multilingual Neural Machine Translation
15 March 2022 by
Yong Cheng
and
others
Computation and Language
,
Artificial Intelligence
mSLAM: Massively multilingual joint pre-training for speech and text
3 February 2022 by
Ankur Bapna
and
others
Computation and Language
,
Machine Learning
Examining Scaling and Transfer of Language Model Architectures for Machine Translation
2 February 2022 by
Biao Zhang
and
others
Computation and Language
,
Machine Learning
Towards the Next 1000 Languages in Multilingual Machine Translation: Exploring the Synergy Between Supervised and Self-Supervised Learning
13 January 2022 by
Aditya Siddhant
and
others
Computation and Language
,
Machine Learning
Joint Unsupervised and Supervised Training for Multilingual ASR
15 November 2021 by
Junwen Bai
and
others
Computation and Language
,
Machine Learning
SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training
20 October 2021 by
Ankur Bapna
and
others
Computation and Language
,
Machine Learning
Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference
24 September 2021 by
Sneha Kudugunta
and
others
at
Google
Computation and Language
,
Machine Learning
Scaling Laws for Neural Machine Translation
16 September 2021 by
Behrooz Ghorbani
and
others
Machine Learning
,
Artificial Intelligence
Load more