FRIEND: Feature Selection on Inconsistent Data

Zhixin Qi,Hongzhi Wang,Tao He,Jianzhong Li,Hong Gao

Neurocomputing（2020）

引用 4|浏览116

暂无评分

摘要

With the explosive growth of information, inconsistent data are increasingly common. However, traditional feature selection methods are lack of efficiency due to inconsistent data repairing beforehand. Therefore, it is necessary to take inconsistencies into consideration during feature selection to not only reduce time costs but also guarantee accuracy of machine learning models. To achieve this goal, we present FRIEND, a feature selection approach on inconsistent data. Since features in consistency rules have higher correlation with each other, we aim to select a specific amount of features from these. We prove that the specific feature selection problem is NP-hard and develop an approximation algorithm for this problem. Extensive experimental results demonstrate the efficiency and effectiveness of our proposed approach.

查看译文

关键词

Feature selection,Inconsistent data,Mutual information,Data quality,Approximation

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要