Value-Aware Resampling And Loss For Imbalanced Classification

PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2018)(2018)

引用 3|浏览31
暂无评分
摘要
Existing(1) machine learning methods usually treat training samples equally, and their performance degrades significantly when facing imbalanced training data. This paper introduces Value-Aware Resampling and Loss (VARL) to tackle the imbalanced classification problem, where high-value samples play a more important role than those of low-value samples in the model training process. Specifically, the training value of each training sample is assessed according to its predicted probability of ground truth label, and then training samples are resampled to produce a balanced training set, at last the model training is further boosted by using an instance-level value-aware loss function. To conduct fair comparisons among different methods, we compile 13 datasets for imbalanced classification. Experiments demonstrate that our proposed method can effectively measure the training value of training samples, and achieve superior performance in imbalanced classification compared with several existing methods.
更多
查看译文
关键词
Imbalanced classification, data resampling, cost-sensitive
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要