Mining script-like structures from the web

Niels Kasch,Tim Oates

FAM-LbR '10: Proceedings of the NAACL HLT 2010 First International Workshop on Formalisms and Methodology for Learning by Reading（2010）

引用 12|浏览13

暂无评分

摘要

This paper presents preliminary work to extract script-like structures, called events and event sets, from collections of web documents. Our approach, contrary to existing methods, is topic-driven in the sense that event sets are extracted for a specified topic. We introduce an iterative system architecture and present methods to reduce noise problems with web corpora. Preliminary results show that LSA-based event relatedness yields better event sets from web corpora than previous methods.

查看译文

关键词

event set,web corpus,LSA-based event relatedness yield,better event set,web document,preliminary result,preliminary work,iterative system architecture,noise problem,present method,mining script-like structure

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要