Hit Count Reliability: How Much Can We Trust Hit Counts?

Koh Satoh,Hayato Yamana

APWeb'12 Proceedings of the 14th Asia-Pacific international conference on Web Technologies and Applications(2012)

引用 13|浏览0
暂无评分
摘要
Recently, there have been numerous studies that rely on the number of search results, i.e., hit count. However, hit counts returned by search engines can vary unnaturally when observed on different days, and may contain large errors that affect researches that depend on those results. Such errors can result in low precision of machine translation, incorrect extraction of synonyms and other problems. Thus, it is indispensable to evaluate and to improve the reliability of hit counts. There exist several researches to show the phenomenon; however, none of previous researches have made clear how much we can trust them. In this paper, we propose hit counts’ reliability metrics to quantitatively evaluate hit counts’ reliability to improve hit count selection. The evaluation results with Google show that our metrics successfully adopt reliable hit counts – 99.8% precision, and skip to adopt unreliable hit counts – 74.3% precision.
更多
查看译文
关键词
Information Retrieval,Search Engine,Hit Count,Reliability
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要