Mining script-like structures from the web

FAM-LbR '10: Proceedings of the NAACL HLT 2010 First International Workshop on Formalisms and Methodology for Learning by Reading(2010)

引用 12|浏览13
暂无评分
摘要
This paper presents preliminary work to extract script-like structures, called events and event sets, from collections of web documents. Our approach, contrary to existing methods, is topic-driven in the sense that event sets are extracted for a specified topic. We introduce an iterative system architecture and present methods to reduce noise problems with web corpora. Preliminary results show that LSA-based event relatedness yields better event sets from web corpora than previous methods.
更多
查看译文
关键词
event set,web corpus,LSA-based event relatedness yield,better event set,web document,preliminary result,preliminary work,iterative system architecture,noise problem,present method,mining script-like structure
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要