Pseudo-Relevance Feedback Based On Mrmr Criteria
INFORMATION RETRIEVAL TECHNOLOGY(2010)
摘要
Pseudo-relevance feedback has shown to be an effective method in many information retrieval tasks. Various criteria have been proposed to rank terms extracted from the top ranked document of the initial retrieval results. However, most existing methods extract terms individually and do not consider the impacts of relationships among terms and their combinations. In this study, we first re-examine this assumption and show that combinations of terms may heavily impact the final results. We then present a novel clustering based method to select expansion terms as a whole set. The main idea is to use first simultaneously cluster terms and documents using non-negative matrix factorization, and then use the Maximum Relevance and Minimum Redundancy criteria to select terms based on their clusters, term distributions, and other features. Experimental results on serval TREC collections show that our proposed method significantly improves performances.
更多查看译文
关键词
Pseudo-relevance Feedback,NMF,mRMR Criteria
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络