Class‐specific variable selection in high‐dimensional discriminant analysis through Bayesian Sparsity

JOURNAL OF CHEMOMETRICS(2019)

引用 8|浏览17
暂无评分
摘要
Although the ongoing digital revolution in fields such as chemometrics, genomics, or personalized medicine gives hope for considerable progress in these areas, it also provides more and more high-dimensional data to analyze and interpret. A common usual task in those fields is discriminant analysis, which however may suffer from the high dimensionality of the data. The recent advances, through subspace classification or variable selection methods, allowed to reach either excellent classification performances or useful visualizations and interpretations. Obviously, it is of great interest to have both excellent classification accuracies and a meaningful variable selection for interpretation. This work addresses this issue by introducing a subspace discriminant analysis method which performs a class-specific variable selection through Bayesian sparsity. The resulting classification methodology is called sparse high-dimensional discriminant analysis (sHDDA). Contrary to most sparse methods which are based on the Lasso, sHDDA relies on a Bayesian modeling of the sparsity pattern and avoids the painstaking and sensitive cross-validation of the sparsity level. The main features of sHDDA are illustrated on simulated and real-world data. In particular, an exemplar application to cancer characterization based on medical imaging using radiomic feature extraction is proposed.
更多
查看译文
关键词
bayesian sparsity,discriminant analysis,high-dimensional data,variable selection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要