A novel hybrid variable selection strategy with application to molecular spectroscopic analysis

Chemometrics and Intelligent Laboratory Systems(2023)

引用 0|浏览2
暂无评分
摘要
In chemometrics, partial least squares (PLS) regression has become an established tool for spectroscopic analysis. Even so, to improve multivariate calibration models’ performance, variable selection is necessary for molecular spectroscopic analysis in many scenarios. However, applying only one variable selection method has many limitations. For example, individual wavelength selection methods may suffer from large computational requirements. In addition, wavelength intervals selection methods may suffer from the retention of uninformative/interfering wavelengths and the effect of collinearity in the selected wavelength intervals. Accordingly, a novel hybrid variable selection strategy, called memetic algorithm-interval partial least squares coupled Hilbert-Schmidt independence criterion based variable space iterative optimization (MA-iPLS + HSIC–VSIO), is proposed in this study. In the first step, a wavelength intervals selection method, MA-iPLS, is used to select optimal wavelength intervals. In the second step, a novel individual wavelength selection method, HSIC–VSIO, is employed for further optimizing the selected wavelength intervals. This hybrid variable selection strategy makes full use of MA-iPLS and HSIC–VSIO. To investigate the performance of MA-iPLS + HSIC–VSIO, it was tested on two groups of spectroscopic datasets: the surface-enhanced Raman scattering (SERS) spectra of chlorpyrifos standard solutions dataset and the near infrared (NIR) spectra of diesel fuels dataset. Nine methods, including PLS, HSIC–VISO, MA-iPLS, genetic algorithm-interval partial least squares (GA-iPLS), particle swarm optimization-interval partial least squares (PSO-iPLS), interval random frog (iRF), GA-iPLS + HSIC–VSIO, PSO-iPLS + HSIC–VSIO, and iRF + HSIC–VSIO, were also applied on the spectroscopic dataset for comparison. The results demonstrated the excellent performance of MA-iPLS + HSIC–VSIO for molecular spectroscopic analysis.
更多
查看译文
关键词
Hybrid variable selection strategy,Molecular spectroscopy,Memetic algorithm,Hilbert-schmidt independence criterion,Variable space iterative optimization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要