The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues.

LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION(2014)

引用 27|浏览89
暂无评分
摘要
The paper describes a project for continuous data collection for a spoken dialogue system engaged in Question-Answering interactions in English. The Wizard-of-Oz method used in the bootstrap phase is presented, and several types of resulting dialogue annotations are described. The resulting corpus will be publicly released.
更多
查看译文
关键词
continuous dialogue data collection,Wizard-of-Oz experiments,semantic annotations
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要