Data-driven lexicon expansion for Mandarin broadcast news and conversation speech recognition

ICASSP, pp. 4329-4332, 2009.

Cited by: 21|Bibtex|Views9|Links
EI
Keywords:
data-driven lexicon expansionpronunciation variantmultiple pronunciationmandarin broadcast speech recognitiontop pronunciation variantMore(20+)

Abstract:

We present a data-driven framework for expanding the lexicon to improve Mandarin broadcast news and conversation speech recognition. The lexicon expansion includes the generation of pronunciation variants for frequent words and vocabulary augmentation with new words and phrases derived from the training data. To learn multiple pronunciati...More

Code:

Data:

Your rating :
0

 

Tags
Comments