ImbTreeEntropy: An R package for building entropy-based classification trees on imbalanced datasets

SoftwareX(2021)

引用 2|浏览3
暂无评分
摘要
In this paper, we propose a novel R package, named ImbTreeEntropy, for building binary and multiclass decision trees using generalized entropy functions, such as Rènyi, Tsallis, Sharma–Mittal, Sharma–Taneja and Kapur, to measure the impurity of a node. These are important extensions of the existing algorithms that usually employ Shannon entropy and the concept of information gain. Additionally, ImbTreeEntropy is able to handle imbalanced data, which is a challenging issue in many practical applications. The package supports cost-sensitive learning by defining a misclassification cost matrix and weighted sensitive learning. It accepts all types of attributes, including continuous, ordered and nominal attributes. The package and its code are made freely available.
更多
查看译文
关键词
Decision trees,Generalized entropy,Cost-sensitive learning,Imbalanced data
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要