Semi-Supervised Feature Selection via Sparse Rescaled Linear Square Regression

IEEE Transactions on Knowledge and Data Engineering(2020)

引用 79|浏览182
暂无评分
摘要
With the rapid increase of the data size, it has increasing demands for selecting features by exploiting both labeled and unlabeled data. In this paper, we propose a novel semi-supervised embedded feature selection method. The new method extends the least square regression model by rescaling the regression coefficients in the least square regression with a set of scale factors, which is used for evaluating the importance of features. An iterative algorithm is proposed to optimize the new model. It has been proved that solving the new model is equivalent to solving a sparse model with a flexible and adaptable $\ell _{2,p}$2,p norm regularization. Moreover, the optimal solution of scale factors provides a theoretical explanation for why we can use $\lbrace \left\Vert \mathbf {w}^{1} \right\Vert _{2},\ldots, \left\Vert \mathbf {w}^{d} \right\Vert _{2}\rbrace${w12,...,wd2} to evaluate the importance of features. Experimental results on eight benchmark data sets show the superior performance of the proposed method.
更多
查看译文
关键词
Feature extraction,Computational complexity,Laplace equations,Knowledge discovery,Data engineering,Iterative methods,Adaptation models
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要