Detecting Parser Errors Using Web-based Semantic Filters.

EMNLP '06: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing(2006)

引用 21|浏览44
暂无评分
摘要
NLP systems for tasks such as question answering and information extraction typically rely on statistical parsers. But the efficacy of such parsers can be surprisingly low, particularly for sentences drawn from heterogeneous corpora such as the Web. We have observed that incorrect parses often result in wildly implausible semantic interpretations of sentences, which can be detected automatically using semantic information obtained from the Web. Based on this observation, we introduce Web-based semantic filtering ---a novel, domain-independent method for automatically detecting and discarding incorrect parses. We measure the effectiveness of our filtering system, called Woodward, on two test collections. On a set of TREC questions, it reduces error by 67%. On a set of more complex Penn Treebank sentences, the reduction in error rate was 20%.
更多
查看译文
关键词
incorrect parses,Web-based semantic,implausible semantic interpretation,semantic information,error rate,information extraction,statistical parsers,NLP system,TREC question,complex Penn Treebank sentence,parser error,web-based semantic filter
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要