Iteratively variable subset optimization for multivariate calibration

Weiting Wang; Yonghuan Yun; Baichuan Deng; Wei Fan; Yizeng Liang

doi:10.1039/C5RA08455E

Iteratively variable subset optimization for multivariate calibration†

Weiting Wang,^a Yonghuan Yun,*^a Baichuan Deng,^ab Wei Fan^c and Yizeng Liang*^a

* Corresponding authors

^a College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, P. R. China
E-mail: yizeng_liang@263.net, yunyonghuan@foxmail.com
Fax: +86-731-88830831
Tel: +86-731-88830824

^b Department of Chemistry, University of Bergen, Bergen N-5007, Norway

^c Joint Lab for Biological Quality and Safety, College of Bioscience and Biotechnology, Hunan Agriculture University, Changsha 410128, P. R. China

Abstract

Based on the theory that a large partial least squares (PLS) regression coefficient on autoscaled data indicates an important variable, a novel strategy for variable selection called iteratively variable subset optimization (IVSO) is proposed in this study. In addition, we take into consideration that the optimal number of latent variables generated by cross-validation will make a great difference to the regression coefficients and sometimes the difference can even vary by several orders of magnitude. In this work, the regression coefficients generated in every sub-model are normalized to remove the influence. In each iterative round, the regression coefficients of each variable obtained from the sub-models are summed to evaluate their importance level. A two-step procedure including weighted binary matrix sampling (WBMS) and sequential addition is employed to eliminate uninformative variables gradually and gently in a competitive way and reduce the risk of losing important variables. Thus, IVSO can achieve high stability. Investigated by using one simulated dataset and two NIR datasets, IVSO shows much better prediction ability than two other outstanding and commonly used methods, Monte Carlo uninformative variable elimination (MC-UVE) and competitive adaptive reweighted sampling (CARS). The MATLAB code for implementing IVSO is available in the ESI.

Supplementary files

Article information

DOI: https://doi.org/10.1039/C5RA08455E
Article type: Paper
Submitted: 07 May 2015
Accepted: 27 Oct 2015
First published: 27 Oct 2015

Download Citation

RSC Adv., 2015,5, 95771-95780

Author version available

Download author version (PDF)

Permissions

Request permissions

Iteratively variable subset optimization for multivariate calibration

W. Wang, Y. Yun, B. Deng, W. Fan and Y. Liang, RSC Adv., 2015, 5, 95771 DOI: 10.1039/C5RA08455E

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

RSC Advances

Iteratively variable subset optimization for multivariate calibration†

Abstract

Supplementary files

Article information

Download Citation

Author version available

Permissions

Iteratively variable subset optimization for multivariate calibration

Social activity

Search articles by author

Spotlight

Advertisements