Questions — Mixture Models and Gaussian Mixture Models

Question 1 Multiple Choice

Two clusters overlap significantly in a 2D dataset. A GMM and k-means are both fitted with K=2. For a point located exactly between the two cluster centers, which statement best describes the difference in outputs?

AK-means and GMM give identical cluster assignments; they differ only in how they handle outliers

BK-means assigns the point to exactly one cluster; the GMM assigns it fractional membership in both components, reflecting genuine uncertainty

CThe GMM assigns the point to the component with the higher covariance; k-means uses only distance to the nearest centroid

DThe GMM will identify more clusters than k-means in overlapping regions

Question 2 Multiple Choice

Why is the EM algorithm used to fit GMMs rather than directly maximizing the likelihood?

AEM is faster because it applies gradient descent to the log-likelihood using automatic differentiation

BDirect optimization is intractable because the log of a sum of Gaussian densities has no closed-form solution; EM sidesteps this by alternating between computing responsibilities and updating parameters

CThe GMM likelihood surface is convex, and EM is the standard solver for convex problems

DEM imposes hard assignments in the E-step that make the M-step analytically tractable by reducing to a k-means update

Question 3 True / False

K-means clustering is a special case of Gaussian Mixture Models where the covariance of each component is fixed as the identity matrix and responsibilities are forced to be 0 or 1.

TTrue

FFalse

Question 4 True / False

A GMM with K components assigns each data point to the single component whose mean is closest to that point, analogous to k-means centroid assignment.

TTrue

FFalse

Question 5 Short Answer

What does it mean for a GMM to perform 'density estimation,' and why is k-means fundamentally incapable of this?

Think about your answer, then reveal below.

Questions: Mixture Models and Gaussian Mixture Models