Questions: Mixture Models and Gaussian Mixture Models

5 questions to test your understanding

Score: 0 / 5
Question 1 Multiple Choice

Two clusters overlap significantly in a 2D dataset. A GMM and k-means are both fitted with K=2. For a point located exactly between the two cluster centers, which statement best describes the difference in outputs?

AK-means and GMM give identical cluster assignments; they differ only in how they handle outliers
BK-means assigns the point to exactly one cluster; the GMM assigns it fractional membership in both components, reflecting genuine uncertainty
CThe GMM assigns the point to the component with the higher covariance; k-means uses only distance to the nearest centroid
DThe GMM will identify more clusters than k-means in overlapping regions
Question 2 Multiple Choice

Why is the EM algorithm used to fit GMMs rather than directly maximizing the likelihood?

AEM is faster because it applies gradient descent to the log-likelihood using automatic differentiation
BDirect optimization is intractable because the log of a sum of Gaussian densities has no closed-form solution; EM sidesteps this by alternating between computing responsibilities and updating parameters
CThe GMM likelihood surface is convex, and EM is the standard solver for convex problems
DEM imposes hard assignments in the E-step that make the M-step analytically tractable by reducing to a k-means update
Question 3 True / False

K-means clustering is a special case of Gaussian Mixture Models where the covariance of each component is fixed as the identity matrix and responsibilities are forced to be 0 or 1.

TTrue
FFalse
Question 4 True / False

A GMM with K components assigns each data point to the single component whose mean is closest to that point, analogous to k-means centroid assignment.

TTrue
FFalse
Question 5 Short Answer

What does it mean for a GMM to perform 'density estimation,' and why is k-means fundamentally incapable of this?

Think about your answer, then reveal below.