An unsupervised feature selection approach for actionable warning identification

Xiuting Ge,Chunrong Fang,Jia Liu,Mingshuang Qing,Xuanye Li,Zhihong Zhao

Expert Syst. Appl.（2023）

引用 1|浏览12

暂无评分

摘要

Static Analysis Tools (SATs) are widely applied to detect defects in software projects. However, SATs are overshadowed by a large number of unactionable warnings, which severely hinder the usability of SATs. To address this problem, the existing approaches commonly use Machine Learning (ML) techniques for Actionable Warning Identification (AWI). For these ML-based AWI approaches, the warning feature determination is one of the most critical parts to effectively identify actionable warnings. To eliminate redundant and irrelevant warning features, ML-based AWI approaches usually incorporate feature selection to determine the feature subset by calculating the importance or correlation of features with warning labels. Nevertheless, warning labels are not always available directly in practice. Thus, it is vital and challenging to select warning features for ML-based AWI approaches when warning labels are absent.To address the above problem, we propose an UNsupervised fEAture SElection approach called UNEASE for ML-based AWI. (1) UNEASE first performs the feature clustering to gather warning features into clusters, where the number of clusters is automatically determined and features in the same cluster are considered redundant. (2) Subsequently, UNEASE performs the feature ranking to sort warning features in each cluster with three newly proposed ranking strategies and selects the top-ranked warning feature from each cluster. Based on the selected features, we train a ML classifier to identify actionable warnings. We conduct experiments in eight large-scale and real-world warning datasets. Comparing UNEASE with nine typical feature reduction techniques, the experimental results show that while taking the low cost to perform the feature selection and maintaining the low redundancy rate in the selected warning features, UNEASE obtains the top-ranked AUC.

查看译文

关键词

Unsupervised feature selection,Actionable warning identification,Machine learning,Static analysis warnings

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要