Large-Scale ASR Domain Adaptation Using Self- and Semi-Supervised Learning

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)(2022)

Cited 32|Views62
No score
Abstract
Self- and semi-supervised learning methods have been actively investigated to reduce labeled training data or enhance model performance. However, these approaches mostly focus on in-domain performance for public datasets. In this study, we utilize the combination of self- and semi-supervised learning methods to solve unseen domain adaptation problems in a large-scale production setting for online ASR model. This approach demonstrates that using the source domain data with a small fraction of the target domain data (3%) can recover the performance gap compared to a full data baseline: 13.5% relative WER improvement for target domain data.
More
Translated text
Key words
speech recognition,domain adaptation,self-supervised learning,semi-supervised learning,RNN-T
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined