Story segmentation of broadcast news in Arabic, Chinese and English using multi-window features.

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval(2007)

引用 7|浏览28
暂无评分
摘要
The paper describes a maximum entropy based story segmentation system for Arabic, Chinese and English. In experiments with broadcast news data from TDT-3, TDT-4, and corpora collected in the DARPA GALE project we obtain a substantial performance gain using multiple overlapping windows for text-based features.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要