A bi-directional LSTM approach for polyphone disambiguation in Mandarin Chinese

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)(2016)

引用 22|浏览45
暂无评分
摘要
Polyphone disambiguation in Mandarin Chinese aims to pick up the correct pronunciation from several candidates for a polyphonic character. It serves as an essential component in human language technologies such as text-to-speech synthesis. Since the pronunciation for most polyphonic characters can be easily decided from their contexts in the text, in this paper, we address the polyphone disambiguation problem as a sequential labeling task. Specifically, we propose to use bidirectional long short-term memory (BLSTM) neural network to encode both the past and future observations on the character sequence as its inputs and predict the pronunciations. We also empirically study the impacts of (1) modeling different length of contexts, (2) the number of BLSTM layers and (3) the granularity of part-o-speech (POS) tags as features. Our results show that using a deep BLSTM is able to achieve state-of-the-art performance in polyphone disambiguation.
更多
查看译文
关键词
Polyphone disambiguation,Grapheme-to-phoneme conversion,Sequence tagging,Bi-directional LSTM,Text-to-Speech
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要