Parallel mining of frequent patterns for school records analytics at the Universidad Michoacana

2017 IEEE International Autumn Meeting on Power, Electronics and Computing (ROPEC)(2017)

引用 1|浏览8
暂无评分
摘要
This paper presents research results on school record analytics, developed for Universidad Michoacana (UM-SNH), based on a parallel implementation of data mining techniques. Core elements of this research work were finding frequent patterns on academic records for all students of UMSNH from 2005 to 2016, and searching for relevant frequent pattern subsets by using the distributed computing platform Spark. The FP-Growth algorithm used for finding frequent patterns is presented, as well as serial, concurrent, and parallel implementations of the mining process based on it. Experimental results are discussed on two different directions: (a) the superior performance achieved by parallel implementation when compared to serial and concurrent versions of the application, and (b) the advantages that mining at the frequent patterns level provides for information retrieval on this specific problem, when compared to mining at association rules or correlation statistics levels.
更多
查看译文
关键词
Data Mining,Big Data,FP-Growth,Parallel Computing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要