Approach to determine Frequent Items Dynamically

HONG-ZHEN ZHENG,DIAN-HUI CHU,DE-CHEN ZHAN

msra（2006）

引用 23|浏览12

暂无评分

摘要

It proposed a new method for dynamically determining the frequent items at any time in a relation which is undergoing deletion operations as well as inserts. Our methods maintain small space data structures that monitor the transactions on the relation, and, when required, quickly output all frequent items without rescanning the relation in the database. With user-specified probability, all frequent items are correctly reported. The methods rely on ideas from "group testing." They are simple to implement, and have provable quality, space, and time guarantees. Previously known algorithms for this problem that make similar quality and performance guarantees cannot handle deletions, and those that handle deletions cannot make similar guarantees without rescanning the database. Our experiments with real and synthetic data show that our algorithms are accurate in dynamically tracking the frequent items independent of the rate of insertions and deletion

查看译文

关键词

data mining..,frequent items,synthetic data,data mining,data structure,group testing

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要