Selection And Combination Of Hypotheses For Dialectal Speech Recognition

Victor Soto,Olivier Siohan,Mohamed Elfeky,Pedro Moreno

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)（2016）

引用 22|浏览60

暂无评分

摘要

While research has often shown that building dialect-specific Automatic Speech Recognizers is the optimal approach to dealing with dialectal variations of the same language, we have observed that dialect-specific recognizers do not always output the best recognitions. Often enough, another dialectal recognizer outputs a better recognition than the dialect-specific one. In this paper, we present two methods to select and combine the best decoded hypothesis from a pool of dialectal recognizers. We follow a Machine Learning approach and extract features from the Speech Recognition output along with Word Embeddings and use Shallow Neural Networks for classification. Our experiments using Dictation and Voice Search data from the main four Arabic dialects show good WER improvements for the hypothesis selection scheme, reducing the WER by 2.1 to 12.1% depending on the test set, and promising results for the hypotheses combination scheme.

查看译文

关键词

speech recognition,dialects,system combination,system selection

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要