Synthical
Your space
Profile
Activity
Favorites
Folders
Feeds
All articles
Simple
Original
Articles by
James Glass
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps
9 July 2024 by
Yung-Sung Chuang
and
others
Computation and Language
,
Artificial Intelligence
DASS: Distilled Audio State Space Models Are Stronger and More Duration-Scalable Learners
4 July 2024 by
Saurabhchand Bhati
and
others
at
MIT
Audio and Speech Processing
Found in the Middle: Calibrating Positional Attention Bias Improves Long Context Utilization
3 July 2024 by
Cheng-Yu Hsieh
and
others
at
University of Washington
Computation and Language
,
Artificial Intelligence
Automatic Prediction of Amyotrophic Lateral Sclerosis Progression using Longitudinal Speech Transformer
26 June 2024 by
Liming Wang
and
others
Sound
,
Artificial Intelligence
Adaptive Query Rewriting: Aligning Rewriters through Marginal Probability of Conversational Answers
16 June 2024 by
Tianhua Zhang
and
others
Computation and Language
Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation
14 June 2024 by
Andrew Rouditchenko
and
others
at
MIT
Audio and Speech Processing
,
Computer Vision and Pattern Recognition
Self-Specialization: Uncovering Latent Expertise within Large Language Models
5 June 2024 by
Junmo Kang
and
others
at
MIT
Computation and Language
,
Artificial Intelligence
THREAD: Thinking Deeper with Recursive Spawning
27 May 2024 by
Philip Schroeder
and
others
Computation and Language
R-Spin: Efficient Speaker and Noise-invariant Representation Learning with Acoustic Pieces
1 April 2024 by
Heng-Jui Chang
and
James Glass
Computation and Language
,
Sound
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models
11 March 2024 by
Yung-Sung Chuang
and
others
Computation and Language
,
Artificial Intelligence
Curiosity-driven Red-teaming for Large Language Models
29 February 2024 by
Zhang-Wei Hong
and
others
Machine Learning
,
Artificial Intelligence
Listen, Think, and Understand
1
19 February 2024 by
Yuan Gong
and
others
at
MIT
Audio and Speech Processing
,
Sound
Improved Cross-Lingual Transfer Learning For Automatic Speech Translation
25 January 2024 by
Sameer Khurana
and
others
Computation and Language
,
Artificial Intelligence
Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective
16 January 2024 by
Alexander Liu
and
others
Audio and Speech Processing
,
Computation and Language
What When and Where? Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions
1 January 2024 by
Brian Chen
and
others
Computer Vision and Pattern Recognition
,
Artificial Intelligence
Joint Audio and Speech Understanding
10 December 2023 by
Yuan Gong
and
others
Sound
,
Artificial Intelligence
Audio-Visual Neural Syntax Acquisition
11 October 2023 by
Cheng-I Jeff Lai
and
others
Computation and Language
,
Machine Learning
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning
19 September 2023 by
Tianhua Zhang
and
others
Computation and Language
Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers
6 July 2023 by
Yuan Gong
and
others
Sound
,
Audio and Speech Processing
SAIL: Search-Augmented Instruction Learning
25 June 2023 by
Hongyin Luo
and
others
Computation and Language
Revealing the Blind Spot of Sentence Encoder Evaluation by HEROS
13 June 2023 by
Cheng-Han Chiang
and
others
Computation and Language
Comparison of Multilingual Self-Supervised and Weakly-Supervised Speech Pre-Training for Adaptation to Unseen Languages
31 May 2023 by
Andrew Rouditchenko
and
others
Computation and Language
,
Sound
Entailment as Robust Self-Learner
26 May 2023 by
Jiaxin Ge
and
others
Computation and Language
Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering
26 May 2023 by
Yung-Sung Chuang
and
others
Computation and Language
Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering
18 May 2023 by
Heng-Jui Chang
and
others
Computation and Language
,
Audio and Speech Processing
C2KD: Cross-Lingual Cross-Modal Knowledge Distillation for Multilingual Text-Video Retrieval
9 May 2023 by
Andrew Rouditchenko
and
others
Computation and Language
,
Computer Vision and Pattern Recognition
On the Blind Spots of Model-Based Evaluation Metrics for Text Generation
5 May 2023 by
Tianxing He
and
others
Computation and Language
Interpretable Unified Language Checking
7 April 2023 by
Tianhua Zhang
and
others
Computation and Language
What, when, and where? -- Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions
29 March 2023 by
Brian Chen
and
others
Computer Vision and Pattern Recognition
Logic Against Bias: Textual Entailment Mitigates Stereotypical Sentence Reasoning
10 March 2023 by
Hongyin Luo
and
James Glass
Computation and Language
,
Artificial Intelligence
Load more