Joint N-Best Rescoring For Repeated Utterances In Spoken Dialog Systems

2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS(2008)

引用 3|浏览52
暂无评分
摘要
Due to speech recognition errors, repetitions are a frequent phenomenon in spoken dialog systems. In previous work [1] we have proposed a joint decoding model that can leverage structural relationships between repeated utterances for improving recognition performance. In this paper we extend this work in two directions. First, we propose a direct, classification-based model for the same task. The new model can leverage features that were fundamentally hard to capture in the previous framework (e.g. spellings, false-starts, etc.) and leads to an additional performance improvement. Second, we show how both models can be used to perform a combined rescoring of two n-best lists that are part of a repetition pair.
更多
查看译文
关键词
speech recognition,repetitions,rescoring
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要