Large-Scale ASR Domain Adaptation Using Self- and Semi-Supervised Learning

Dongseong Hwang,Ananya Misra,Zhouyuan Huo,Nikhil Siddhartha,Shefali Garg,David Qiu,Khe Chai Sim,Trevor Strohman,Françoise Beaufays,Yanzhang He

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)（2022）

Cited 32|Views62

No score

Abstract

Self- and semi-supervised learning methods have been actively investigated to reduce labeled training data or enhance model performance. However, these approaches mostly focus on in-domain performance for public datasets. In this study, we utilize the combination of self- and semi-supervised learning methods to solve unseen domain adaptation problems in a large-scale production setting for online ASR model. This approach demonstrates that using the source domain data with a small fraction of the target domain data (3%) can recover the performance gap compared to a full data baseline: 13.5% relative WER improvement for target domain data.

Translated text

Key words

speech recognition,domain adaptation,self-supervised learning,semi-supervised learning,RNN-T

AI Read Science

Must-Reading Tree

Example

Generate MRT to find the research sequence of this paper

Chat Paper

Summary is being generated by the instructions you defined