Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group
  • Journal of Near Infrared Spectroscopy
  • Vol. 18,
  • Issue 4,
  • pp. 231-237
  • (2010)

The Importance of Choosing the Right Validation Strategy in Inverse Modelling

Not Accessible

Your library or personal account may give you access

Abstract

Inverse modelling techniques, such as principal component regression, partial least squares regression and support vector machines, are very powerful multivariate calibration strategies which are widely used in near infrared spectroscopy. However, these techniques are so efficient in finding correlations between the spectral variables and the parameter to be predicted that great care should be taken to avoid over-optimistic results by use of a proper validation strategy. In this study, different validation strategies were investigated on a dataset that was acquired during various measurement days. The goal was to predict albumen freshness based on spectral measurements. Validation procedures frequently applied in practice, i.e. 10-fold cross-validation (10-fold CV) and validation based on random subdivision in calibration and validation set (RS) were compared to a cross-validation across measuring day (MD). Whereas 10-fold CV and RS validation suggested that prediction of albumen freshness is possible, MD validation on the same dataset indicated that albumen freshness cannot be predicted from the spectral measurements. It is shown that inverse modelling is very sensitive to unspecific correlations between the spectral measurements and the dependent variable, which might be artifacts of the measurement protocol and will not be persistent in the future. Therefore, selection of the right validation strategy for a given application and critical evaluation of the obtained results are crucial steps in inverse modelling to obtain useful calibration models. More specifically, in the context of process analytical technology where spectra are acquired over time, great care should be taken to break the unspecific correlation between the dependent variable and the variations in the spectral measurements over time.

© 2010 IM Publications LLP

PDF Article
More Like This
Rapid determination of the main components of corn based on near-infrared spectroscopy and a BiPLS-PCA-ELM model

Lili Xu, Jinming Liu, Chunqi Wang, Zhijiang Li, and Dongjie Zhang
Appl. Opt. 62(11) 2756-2765 (2023)

Statistical approach to choosing a strategy of monochromatic monitoring of optical coating production

A. V. Tikhonravov, M. K. Trubetskov, and T. V. Amotchkina
Appl. Opt. 45(30) 7863-7870 (2006)

Cited By

You do not have subscription access to this journal. Cited by links are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.