Efficiently Mining Closed Subsequences with Gap Constraints

SDM(2008)

引用 79|浏览22
暂无评分
摘要
Mining frequent subsequence patterns from sequence databases is a typical data mining problem and various e-cient sequential pattern mining algorithms have been proposed. In many problem domains (e.g, biology), the frequent subsequences conflned by the predeflned gap requirements are more meaningful than the general sequential patterns. In this paper we re-examine the closed sequential pattern mining problem by introduc- ing the gap constraints. The most challenging parts in this task include the constrained pattern closure check- ing and unpromising search space pruning. Inspired by some state-of-the-art closed or constrained sequential pattern mining algorithms, we propose an e-cient ap- proach to flnding the complete set of closed sequential patterns with gap constraints. The approach combines the newly devised constrained pattern closure checking scheme and pruning techniques with the pattern growth based subsequence enumeration framework. Our exten- sive performance study shows that our approach is very e-cient in mining frequent closed subsequences with gap constraints.
更多
查看译文
关键词
gap-constraint.,. constrained subsequence mining,frequent closed subsequence,sequential pattern mining,data mining,search space
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要