An approach for building lexical-semantic resources based on heterogeneous information sources

José Gildo de A. Júnior,Ulrich Schiel,Leandro Balby Marinho

SAC 2015: Symposium on Applied Computing Salamanca Spain April, 2015(2015)

引用 1|浏览12
暂无评分
摘要
Lexical-semantic resources (LSRs) have an important role in many information retrieval and extraction tasks. However, in order to be effective, LSRs need to cover a broad spectrum of knowledge facets about terms, e.g., encyclopedic, linguistic, and common sense. These different knowledge facets are usually found in different and heterogeneous knowledge sources spread over the Web, turning the task of integrating them into an unified LSR a hard one. In this work, we propose a new approach to automatically build LSRs that are tailored to semantic search engines, i.e., our approach builds LSRs that favor disambiguation and faceted search. Moreover, while most of the related work is limited to using encyclopedic sources only, such as Wikipedia, we instantiate our approach using additional and heterogeneous knowledge sources, such as dictionaries (WordNet), common sense (Open Mind Common Sense), and semantic networks (Conceptnet5). For evaluation, we compare our approach with UBY, a open source and state-of-the-art LSR, in terms of the number of terms covered; the degree of semantic connections established for each term; and the strength of the extracted semantic relations between terms. We conducted experiments in several test sets and show that our approach is superior under the aforementioned aspects.
更多
查看译文
关键词
Knowledge Integration, Lexical-Semantic Resource
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要