USER: Towards High-Utility Sequential Rules with Repetitive Items.

2023 IEEE International Conference on Big Data (BigData)(2023)

引用 0|浏览4
暂无评分
摘要
Discovering interesting sequential rules in the sequence database is quite important for a variety of fields, ranging from customer behavior analysis to intrusion detection. High utility sequential rule mining (HUSRM) was proposed to obtain more informative rules. Its goal is to find those sequential rules with high utility values and high confidence, i.e., HUSRs. As far as we know, a few algorithms are proposed to discover HUSRs. However, these algorithms do not fully consider the existence of repetitive items in the sequences of the database. In this paper, we propose an algorithm named USER to discover HUSRs in multi-sequences with the existence of repetitive items. A data structure called an occurrence information (OI)-list is designed to distinguish the different occurrences of items in a sequence. Moreover, the change in the upper bound value after the rule expansion is discussed in detail, which is complicated by the repetitive items. We also introduce two pruning strategies (ROOR and REIO-I) to optimize mining efficiency when there are too many repetitive items in the sequence. Finally, we conduct experiments on several datasets, and the results show that USER is able to discover HUSRs with more accurate utility values in an acceptable amount of time and memory consumption.
更多
查看译文
关键词
sequential rule,utility mining,repetitive item,OI-list
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要