An Efficient Algorithm for Incremental Update of Concept Spaces
PAKDD(2002)
摘要
The vocabulary problem in information retrieval arises because authors and indexers often use different terms for the same concept. A thesaurus defines mappings between different but related terms. It is widely used in modern information retrieval systems to solve the vocabulary problem. Chen et al. proposed the concept space approach to automatic thesaurus construction.A concept space contains the associations between every pair of terms. Prev ious research studies show that concept space is a useful tool for helping information searchers in revising their queries in order to get better results from information retrieval systems. The construction of a concept space, however, is very computationally intensive. In this paper, we propose and evaluate an efficient algorithm for the incremental update of concept spaces. In our model, only strong associations are maintained, since they are most useful in thesauri construction. Our algorithm uses a pruning technique to avoid computing weak associations to achieve efficiency.
更多查看译文
关键词
different term,efficient algorithm,incremental update,information retrieval system,concept spaces,concept space,thesauri construction,vocabulary problem,information retrieval,information searcher,modern information retrieval system,automatic thesaurus construction,concept space approach,text mining,image features
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络