Questions: Mixture Models and Latent Class Analysis in Testing
5 questions to test your understanding
Score: 0 / 5
Question 1 Multiple Choice
A psychometrician runs both an IRT model and a latent class analysis on the same achievement test. The IRT model estimates each student's position on an ability continuum; the LCA identifies three latent classes. What fundamentally different question does LCA answer that IRT cannot?
AHow many items the test needs to achieve adequate reliability
BWhether there are qualitatively distinct subpopulations of test-takers rather than a single continuous distribution of ability
CHow internally consistent the test items are with one another
DWhether the test has construct validity relative to an external criterion
IRT is variable-centered: it asks 'where does this person fall on the ability continuum?' LCA is person-centered: it asks 'what type of person is this?' These are not competing answers to the same question — they describe different kinds of structure. IRT assumes everyone is on the same continuum; LCA asks whether the population actually consists of qualitatively different subgroups (e.g., careful readers vs. fast guessers) that may require entirely different measurement models.
Question 2 Multiple Choice
A researcher adds more latent classes to a mixture model and finds that model fit as measured by AIC keeps improving with each additional class. What is the correct response to this finding?
AKeep adding classes until AIC stops improving — that number is the true number of latent subpopulations
BConclude the data have no meaningful latent class structure since the model never stabilizes
CBalance statistical fit indices against the interpretability, replicability, and external validity of the classes — fit alone does not determine the right number of classes
DSwitch to IRT, which avoids the problem of model selection entirely
Model fit statistics like AIC and BIC will often keep improving with more classes because more classes always explain more variance. This is why statistical fit must be balanced against substantive criteria: Are the classes interpretable and meaningful? Do they replicate in independent samples? Do they predict external variables (like treatment response) in expected ways? Choosing K based purely on fit statistics can produce statistically optimal but substantively meaningless classes.
Question 3 True / False
In a latent class model, the assumption of local independence within classes means that once you know a respondent's class membership, their responses to individual items are negatively correlated.
TTrue
FFalse
Answer: False
Local independence within classes means responses are *statistically independent* — not correlated in either direction — once class membership is known. The logic is that class membership explains all the correlations among items. This mirrors the local independence assumption in IRT, where ability explains all item correlations. If items remain correlated within a class, that's a sign the model needs more classes or that a continuous dimension exists within the class.
Question 4 True / False
The output of a latent class analysis assigns each respondent a vector of probabilities of belonging to each class, rather than a definitive class membership.
TTrue
FFalse
Answer: True
LCA estimates posterior probabilities: each respondent gets a probability of belonging to each class (e.g., 0.78 probability of Class 1, 0.22 probability of Class 2). Researchers often assign people to their most probable class for descriptive purposes, but this 'hard assignment' discards uncertainty. Formal analyses should propagate the full probability vector to avoid treating uncertain classifications as certain — a point that distinguishes mixture modeling from simple group-comparison designs.
Question 5 Short Answer
What is the fundamental difference between a variable-centered approach like IRT and a person-centered approach like latent class analysis? When would you choose one over the other?
Think about your answer, then reveal below.
Model answer: Variable-centered approaches ask 'where does this person fall on the trait continuum?' and model individual differences as a matter of degree. Person-centered approaches ask 'what kind of person is this?' and model individual differences as qualitative types. Choose IRT when variation is expected to be continuous and gradational (e.g., a general ability dimension). Choose LCA when you suspect qualitatively distinct subgroups exist — different response strategies, distinct symptom profiles, or identifiable misconception types — that shouldn't be collapsed onto a single dimension.
The two approaches are not mutually exclusive. A population can have both continuous within-class variation and discrete between-class differences — this is modeled by mixture IRT. The choice depends on the substantive question: 'How much?' calls for IRT; 'What type?' calls for LCA. A practical implication: if you fit IRT to a sample that actually contains two qualitatively different groups (e.g., motivated and unmotivated test-takers), your IRT parameters will be biased averages that describe neither group accurately. Mixture modeling detects and corrects for this hidden heterogeneity.