Model choice and squared prediction errors in PLS regression

Rolf Ergon, Maths Halstensen, Kim H. Esbensen

Research output: Contribution to journalArticleResearchpeer-review

2 Citations (Scopus)

Abstract

Squared prediction errors (SPE) in X are discussed in relation to the conventional PLSR versus bidiagonalization model and algorithm issue concerning residual and prediction consistency, with focus on process monitoring and fault detection. Our analysis leads to the conclusion that conventional PLSR based on the NIPALS algorithm is ambiguous in SPE values caused by process faults. The basic reason for this is that the sample residuals are not found as projections onto the orthogonal complement of the space where the scores and regression solution are located, and where also the statistical T 2 limit is defined. The alternative non-orthogonalized PLSR and bidiagonalization (Bidiag2) algorithms, as well as a simple re-formulation of the NIPALS algorithm (RE-PLSR), give unambiguous SPE values, and the last two of these also retain orthogonal score vectors. While prediction results from all of these methods in theory are identical, our conclusion is that methods where the T 2 and SPE values for process faults are uncorrelated should be preferred. Tests with added y errors on real data do not indicate that this conclusion should be altered because of such errors.

Original languageEnglish
Pages (from-to)301-312
Number of pages12
JournalJournal of Chemometrics
Volume25
Issue number6
DOIs
Publication statusPublished - Jun 2011

Keywords

  • NIPALS
  • Partial least squares regression
  • Process monitoring
  • Residual consistency
  • SPE plots
  • Squared prediction errors in X

Programme Area

  • Programme Area 3: Energy Resources

Fingerprint

Dive into the research topics of 'Model choice and squared prediction errors in PLS regression'. Together they form a unique fingerprint.

Cite this