Investigation of sample partitioning in quantitative near-infrared analysis of soil organic carbon based on parametric LS-SVR modeling

Hua-Zhou Chen; Kai Shi; Ken Cai; Li-Li Xu; Quan-Xi Feng

doi:10.1039/C5RA12468A

Investigation of sample partitioning in quantitative near-infrared analysis of soil organic carbon based on parametric LS-SVR modeling

Hua-Zhou Chen,^a Kai Shi,^a Ken Cai,*^b Li-Li Xu^c and Quan-Xi Feng^a

Author affiliations

* Corresponding authors

^a College of Science, Guilin University of Technology, Guilin 541004, China

^b School of Information Science and Technology, Zhongkai University of Agriculture and Engineering, Guangzhou, China
E-mail: kencaizhku@foxmail.com

^c School of Ocean, Qinzhou University, Qinzhou, China

Abstract

Soil organic carbon (SOC) can be quantitatively determined with the enhanced stability of near-infrared (NIR) measurements. NIR analysis requires a modeling-validation division for real samples. The research of modeling robustness should be discussed in the modeling process based on an investigation of the calibration–prediction sample partitioning. A framework for sample partitioning is proposed with the consideration of the tunable ratio of numbers of the calibration and prediction samples. We addressed this issue in the multivariate calibration for NIR analysis of SOC using the least squares support vector machine regression (LS-SVR) method with an interactive grid search of its two modeling parameters, γ and σ, where γ is the regularization parameter directly influencing the Lagrange multiplier in the kernel transformation, and σ² represents the kernel width used to tune the degree of generalization. We created 7 volunteer groups for different ratios of calibration–prediction partitions. The calibration and prediction samples were re-produced for each volunteer group. The LS-SVR models were established and the parameters were optimally selected by considering the stability and robustness based on the statistical theory of mean value and relative standard deviation. Furthermore, in all comparative partition ratios, the optimal volunteer group was selected with the partition of 65 calibration samples and 35 prediction samples. Consequently, the optimized calibration model with correspondent optimal volunteer group was evaluated by the independent validation samples. The optimal LS-SVR parameters (γ, σ) were (110, 7), and the validation results revealed a root mean square error of 0.302 and a correlation coefficient of 0.907. This validation effect was considerably satisfactory for the random validation samples because an optimal volunteer group was chosen for calibration–prediction partition to guarantee the modeling stability and robustness in the process of model optimization.

Article information

https://doi.org/10.1039/C5RA12468A

Article type

Paper

Submitted

27 Jun 2015

Accepted

02 Sep 2015

First published

04 Sep 2015

Download Citation

RSC Adv., 2015,5, 80612-80619

Permissions

Request permissions

Investigation of sample partitioning in quantitative near-infrared analysis of soil organic carbon based on parametric LS-SVR modeling

H. Chen, K. Shi, K. Cai, L. Xu and Q. Feng, RSC Adv., 2015, 5, 80612 DOI: 10.1039/C5RA12468A

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

RSC Advances

Investigation of sample partitioning in quantitative near-infrared analysis of soil organic carbon based on parametric LS-SVR modeling

Abstract

Article information

Download Citation

Author version available

Permissions

Investigation of sample partitioning in quantitative near-infrared analysis of soil organic carbon based on parametric LS-SVR modeling

Social activity

Search articles by author

Spotlight

Advertisements