TabEL: Entity Linking in Web Tables.

International Semantic Web Conference(2015)

引用 206|浏览105
暂无评分
摘要
Web tables form a valuable source of relational data. The Web contains an estimated 154 million HTML tables of relational data, with Wikipedia alone containing 1.6 million high-quality tables. Extracting the semantics of Web tables to produce machine-understandable knowledge has become an active area of research. A key step in extracting the semantics of Web content is entity linking EL: the task of mapping a phrase in text to its referent entity in a knowledge base KB. In this paper we present TabEL, a new EL system for Web tables. TabEL differs from previous work by weakening the assumption that the semantics of a table can be mapped to pre-defined types and relations found in the target KB. Instead, TabEL enforces soft constraints in the form of a graphical model that assigns higher likelihood to sets of entities that tend to co-occur in Wikipedia documents and tables. In experiments, TabEL significantly reduces error when compared to current state-of-the-art table EL systems, including a $$75\\%$$ error reduction on Wikipedia tables and a $$60\\%$$ error reduction on Web tables. We also make our parsed Wikipedia table corpus and test datasets publicly available for future work.
更多
查看译文
关键词
Web tables,Entity linking,Named entity disambiguation,Graphical models
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要