Using N-best lists for Named Entity Recognition from Chinese Speech.

HLT-NAACL-Short '04: Proceedings of HLT-NAACL 2004: Short Papers(2004)

引用 14|浏览22
暂无评分
摘要
We present the first known result for named entity recognition (NER) in realistic large-vocabulary spoken Chinese. We establish this result by applying a maximum entropy model, currently the single best known approach for textual Chinese NER, to the recognition output of the BBN LVCSR system on Chinese Broadcast News utterances. Our results support the claim that transferring NER approaches from text to spoken language is a significantly more difficult task for Chinese than for English. We propose re-segmenting the ASR hypotheses as well as applying post-classification to improve the performance. Finally, we introduce a method of using n -best hypotheses that yields a small but nevertheless useful improvement NER accuracy. We use acoustic, phonetic, language model, NER and other scores as confidence measure. Experimental results show an average of 6.7% relative improvement in precision and 1.7% relative improvement in F-measure.
更多
查看译文
关键词
relative improvement,NER approach,textual Chinese NER,useful improvement NER accuracy,Chinese Broadcast News utterance,entity recognition,experimental result,known result,language model,maximum entropy model,Chinese speech,N-best list
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要