Leveraging Social Annotation For Topic Language Model Adaptation

13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3(2012)

引用 24|浏览13
暂无评分
摘要
Social annotations such as Yahoo! Answers(1) already define broad coverage of hierarchical topic categories and include millions of documents annotated by web users. For topic language model (LM) adaptation, we present a novel approach of performing topic segmentation of LM training corpus via leveraging such social annotations, which may be more effective than unsupervised methods. Experimental results on the IWSLT-2011 TED ASR data sets demonstrate that we can achieve modest improvements when compared with the unsupervised methods such as clustering-based and LDA-based algorithms, e.g., the absolute WER improvement over the LDA-based method on the test set reaches 0.5 points.
更多
查看译文
关键词
Topic language model adaptation,social annotations,speech recognition
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要