FEVER: a large-scale dataset for Fact Extraction and VERification

James Thorne,Andreas Vlachos,Christos Christodoulopoulos,Arpit Mittal

arXiv (Cornell University)（2018）

引用 1336|浏览242

暂无评分

摘要

In this paper we introduce a new publicly available dataset for verification against textual sources, FEVER: Fact Extraction and VERification. It consists of 185,445 claims generated by altering sentences extracted from Wikipedia and subsequently verified without knowledge of the sentence they were derived from. The claims are classified as Supported, Refuted or NotEnoughInfo by annotators achieving 0.6841 in Fleiss κ. For the first two classes, the annotators also recorded the sentence(s) forming the necessary evidence for their judgment. To characterize the challenge of the dataset presented, we develop a pipeline approach and compare it to suitably designed oracles. The best accuracy we achieve on labeling a claim accompanied by the correct evidence is 31.87 FEVER is a challenging testbed that will help stimulate progress on claim verification against textual sources.

查看译文

关键词

fact extraction,fever,dataset,large-scale

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要