Multi-speaker language modeling

HLT-NAACL (Short Papers)(2004)

引用 36|浏览9
暂无评分
摘要
In conventional language modeling, the words from only one speaker at a time are represented, even for conversational tasks such as meetings and telephone calls. In a conversational or meeting setting, however, speakers can have significant influence on each other. To recover such un-modeled inter-speaker information, we introduce an approach for conversational language modeling that considers words from other speakers when predicting words from the current one. By augmenting a normal trigram context, our new multi-speaker language model (MSLM) improves on both Switchboard and ICSI Meeting Recorder corpora. Using an MSLM and a conditional mutual information based word clustering algorithm, we achieve a 8.9% perplexity reduction on Switchboard and a 12.2% reduction on the ICSI Meeting Recorder data.
更多
查看译文
关键词
conversational language modeling,meeting setting,recorder data,perplexity reduction,conversational task,conventional language modeling,un-modeled inter-speaker information,new multi-speaker language model,multi-speaker language modeling,conditional mutual information,recorder corpus
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要