Novelty and redundancy detection in adaptive filtering.
IR(2002)
摘要
ABSTRACTThis paper addresses the problem of extending an adaptive information filtering system to make decisions about the novelty and redundancy of relevant documents. It argues that relevance and redundance should each be modelled explicitly and separately. A set of five redundancy measures are proposed and evaluated in experiments with and without redundancy thresholds. The experimental results demonstrate that the cosine similarity metric and a redundancy measure based on a mixture of language models are both effective for identifying redundant documents.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络