Approach to determine Frequent Items Dynamically

msra(2006)

引用 23|浏览12
暂无评分
摘要
It proposed a new method for dynamically determining the frequent items at any time in a relation which is undergoing deletion operations as well as inserts. Our methods maintain small space data structures that monitor the transactions on the relation, and, when required, quickly output all frequent items without rescanning the relation in the database. With user-specified probability, all frequent items are correctly reported. The methods rely on ideas from "group testing." They are simple to implement, and have provable quality, space, and time guarantees. Previously known algorithms for this problem that make similar quality and performance guarantees cannot handle deletions, and those that handle deletions cannot make similar guarantees without rescanning the database. Our experiments with real and synthetic data show that our algorithms are accurate in dynamically tracking the frequent items independent of the rate of insertions and deletion
更多
查看译文
关键词
data mining..,frequent items,synthetic data,data mining,data structure,group testing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要