Dealing with web data: history and look ahead

PVLDB(2010)

引用 2|浏览27
暂无评分
摘要
The high rate of change and the unprecedented scale of the Web pose enormous challenges to search engines who wish to provide the most up-to-date and highly relevant information to its users. The VLDB 2000 paper "The Evolution of the Web and Implications for an Incremental Crawler" tried to address part of this challenge by collecting and analyzing the Web history data and by describing the architecture and the associated algorithms for an incremental Web crawler that can provide more up-to-date data to users in a timely manner. Experiments and theoretical analysis showed --- surprisingly at the time --- that a policy that allocates more resources to more frequently changing items does not necessarily lead to better performance. In this paper, we discuss what has happened in the 10 years since and talk about the challenges that lie head.
更多
查看译文
关键词
high rate,incremental crawler,web data,associated algorithm,incremental web crawler,relevant information,enormous challenge,web history data,theoretical analysis,better performance,up-to-date data,look ahead
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要