5 questions to test your understanding
Two clusters overlap significantly in a 2D dataset. A GMM and k-means are both fitted with K=2. For a point located exactly between the two cluster centers, which statement best describes the difference in outputs?
Why is the EM algorithm used to fit GMMs rather than directly maximizing the likelihood?
K-means clustering is a special case of Gaussian Mixture Models where the covariance of each component is fixed as the identity matrix and responsibilities are forced to be 0 or 1.
A GMM with K components assigns each data point to the single component whose mean is closest to that point, analogous to k-means centroid assignment.
What does it mean for a GMM to perform 'density estimation,' and why is k-means fundamentally incapable of this?