Imbalance example-dependent cost classification: A Bayesian based method

EXPERT SYSTEMS WITH APPLICATIONS(2023)

引用 1|浏览18
暂无评分
摘要
Example-dependent cost classification is a special case of pattern classification where the costs are specific for each individual pattern. Most of the practical applications related to this kind of classification problem exhibit class imbalance in the available data, thus including an additional difficulty to the classification task. This problem has high practical importance because it appears intrinsically in relevant application fields, such as Finance or Health. We propose to use a 2-step Bayesian methodology to solve this problem because its formulation allows the inclusion of the individual example costs in the classification and takes into account the class probabilities. In particular, the main contribution is to apply principled rebalancing classification algorithms in the first step: We propose 3 Neural Network based learning machines, WR-MLP, WSR-MLPE and WSR-DNN, to provide the estimates of the required conditional probabilities for the Bayesian test. Unlike some similar approaches in the literature that use heuristic methods in the first step, which in most cases require calibration mechanisms to compensate for the estimation biases, the consistency of the proposed estimates is theoretically supported, thus providing a clear potential advantage. Experiments with seven real-world datasets show that the proposed methods are competitive against eleven state-of-the-art benchmarks, and provide an advantage in the less favourable situations: cases with a strong imbalance and highly nonlinear classification borders.
更多
查看译文
关键词
Bregman divergences, Classification, Example-dependent cost, Imbalanced data, Neural networks
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要