Data-driven lexicon expansion for Mandarin broadcast news and conversation speech recognition
ICASSP, pp. 4329-4332, 2009.
data-driven lexicon expansionpronunciation variantmultiple pronunciationmandarin broadcast speech recognitiontop pronunciation variantMore(20+)
We present a data-driven framework for expanding the lexicon to improve Mandarin broadcast news and conversation speech recognition. The lexicon expansion includes the generation of pronunciation variants for frequent words and vocabulary augmentation with new words and phrases derived from the training data. To learn multiple pronunciati...More
Full Text (Upload PDF)
PPT (Upload PPT)