Questions: K-Means Clustering

5 questions to test your understanding

Score: 0 / 5
Question 1 Multiple Choice

K-Means is run twice on the same dataset with different random initializations and produces two different final cluster assignments. What does this most likely indicate?

AThe data contains no meaningful cluster structure, so K-Means returns arbitrary results
BK-Means has converged to different local minima of the within-cluster sum of squared distances
CThe value of k is incorrect and should be adjusted
DK-Means always produces random output and cannot be trusted for analysis
Question 2 Multiple Choice

A dataset contains two elongated, crescent-shaped clusters that curve around each other. K-Means with k=2 consistently fails to separate them, instead splitting each crescent roughly in half. What is the fundamental reason for this failure?

AK-Means needs more iterations to discover curved cluster boundaries
BThe features require standardization before K-Means can handle non-circular shapes
CK-Means assigns points to the nearest centroid using Euclidean distance, implicitly assuming spherical clusters — curved or irregular shapes violate this assumption
Dk=2 is too small; increasing k would resolve the problem
Question 3 True / False

Each iteration of the K-Means algorithm is guaranteed to reduce or maintain the total within-cluster sum of squared distances.

TTrue
FFalse
Question 4 True / False

The elbow method identifies the optimal value of k by selecting the k that produces the lowest inertia.

TTrue
FFalse
Question 5 Short Answer

Why does K-Means++ improve on random initialization, and what property of the initial centroids does it aim to achieve?

Think about your answer, then reveal below.