Large-scale validation and analysis of interleaved search evaluation

Olivier Chapelle,Thorsten Joachims,Filip Radlinski,Yisong Yue

ACM Trans. Inf. Syst.（2012）

引用 219|浏览72

暂无评分

摘要

Interleaving is an increasingly popular technique for evaluating information retrieval systems based on implicit user feedback. While a number of isolated studies have analyzed how this technique agrees with conventional offline evaluation approaches and other online techniques, a complete picture of its efficiency and effectiveness is still lacking. In this paper we extend and combine the body of empirical evidence regarding interleaving, and provide a comprehensive analysis of interleaving using data from two major commercial search engines and a retrieval system for scientific literature. In particular, we analyze the agreement of interleaving with manual relevance judgments and observational implicit feedback measures, estimate the statistical efficiency of interleaving, and explore the relative performance of different interleaving variants. We also show how to learn improved credit-assignment functions for clicks that further increase the sensitivity of interleaving.

查看译文

关键词

statistical efficiency,popular technique,information retrieval system,implicit user feedback,different interleaving variant,large-scale validation,comprehensive analysis,observational implicit feedback measure,interleaved search evaluation,online technique,complete picture,retrieval system,sensitivity,empirical evidence,search engine,interleaving

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要