Questions — Chemometrics: Multivariate Calibration and Data Analysis

Question 1 Multiple Choice

A chemist builds a PLS model for predicting glucose in blood plasma from near-IR spectra, using 15 latent variables. The model predicts the training set with excellent accuracy but performs poorly on new patient samples. What is the most likely explanation?

ANear-IR spectroscopy is inherently too insensitive for glucose in complex biological matrices

BThe model has overfit the training data by including too many latent variables, learning noise rather than true chemical signal

CPLS regression is not appropriate for biological samples with variable composition

DThe training set was too large, which reduced the model's sensitivity to individual samples

Question 2 Multiple Choice

Why is PLS regression preferred over PCA for building quantitative concentration prediction models in chemometrics?

APCA is computationally too expensive for large spectral datasets

BPCA is a supervised method that already incorporates concentration information

CPLS finds latent variables that simultaneously capture spectral variance AND correlate with target concentration; PCA is unsupervised and may find variance directions unrelated to the analyte

DPLS requires fewer calibration standards than PCA to build a reliable model

Question 3 True / False

Adding more spectral variables (wavelengths) to a chemometric calibration model usually improves prediction accuracy because more information is generally beneficial.

TTrue

FFalse

Question 4 True / False

Cross-validation is essential in chemometric model building because it provides an unbiased estimate of prediction performance on new samples and helps identify the appropriate number of latent variables.

TTrue

FFalse

Question 5 Short Answer

What fundamental limitation of univariate calibration does multivariate calibration (e.g., PLS) overcome, and what new risk does it introduce?

Think about your answer, then reveal below.

Questions: Chemometrics: Multivariate Calibration and Data Analysis