Big data classification using heterogeneous ensemble classifiers in Apache Spark based on MapReduce paradigm
Expert Systems with Applications(2021)
摘要
•Distributed Heterogeneous Ensemble is designed for big data classification.•Classifiers are pruned from the ensemble to increase the diversity.•A Spark version of DHBoost is presented based on MapReduce programming paradigm.•DHBoost outperforms the state-of-the-art ensemble classifiers in the Spark library.
更多查看译文
关键词
Ensemble classifier,Boosting,MapReduce,Big data,Apache Spark,Apache Hadoop
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络