Large-Scale ASR Domain Adaptation Using Self- and Semi-Supervised Learning
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)(2022)
Abstract
Self- and semi-supervised learning methods have been actively investigated to reduce labeled training data or enhance model performance. However, these approaches mostly focus on in-domain performance for public datasets. In this study, we utilize the combination of self- and semi-supervised learning methods to solve unseen domain adaptation problems in a large-scale production setting for online ASR model. This approach demonstrates that using the source domain data with a small fraction of the target domain data (3%) can recover the performance gap compared to a full data baseline: 13.5% relative WER improvement for target domain data.
MoreTranslated text
Key words
speech recognition,domain adaptation,self-supervised learning,semi-supervised learning,RNN-T
AI Read Science
Must-Reading Tree
Example
![](https://originalfileserver.aminer.cn/sys/aminer/pubs/mrt_preview.jpeg)
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined