WordNet Ontology Based Model for Web Retrieval

Tokyo(2005)

引用 70|浏览16
暂无评分
摘要
It is well known that ontologies will become a key piece, as they allow making the semantics of Semantic Web content explicit. In spite of the big advantages that the Semantic Web promises, there are still several problems to solve. Those concerning ontologies include their availability, development, and evolution. In the area of information retrieval, the dimension of document vectors plays an important role. Firstly, with higher index dimensions the indexing structures suffer from the "curse of dimensionality" and their efficiency rapidly decreases. Secondly, we may not use exact words when looking for a document, thus we miss some relevant documents. LSI is a numerical method, which discovers latent semantics in documents by creating concepts from existing terms. In this paper we present a basic method of mapping LSI concepts on given ontology (Word- Net), used both for retrieval recall improvement and dimension reduction.We offer experimental results for this method on a subset of TREC collection, consisting of Los Angeles Times articles.
更多
查看译文
关键词
web retrieval,higher index dimension,lsi concept,dimension reduction,numerical method,relevant document,information retrieval,document vector,wordnet ontology,latent semantics,basic method,semantic web,ontologies,indexation,indexing terms,multidimensional systems,indexing,software engineering,curse of dimensionality,computer science
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要