Modeling Randomized Data Streams In Caching, Data Processing, And Crawling Applications

2015 IEEE Conference on Computer Communications (INFOCOM)(2015)

引用 3|浏览10
暂无评分
摘要
Many BigData applications (e.g., MapReduce, web caching, search in large graphs) process streams of random key-value records that follow highly skewed frequency distributions. In this work, we first develop stochastic models for the probability to encounter unique keys during exploration of such streams and their growth rate over time. We then apply these models to the analysis of LRU caching, MapReduce overhead, and various crawl properties (e.g., node-degree bias, frontier size) in random graphs.
更多
查看译文
关键词
randomized data streams,caching application,data processing,crawling application,Big Data applications,frequency distribution,stochastic model,probability,LRU caching,MapReduce overhead,crawl properties,random graphs
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要