Questions — K-Means Clustering — Open Knowledge Graph

Question 1 Multiple Choice

K-Means is run twice on the same dataset with different random initializations and produces two different final cluster assignments. What does this most likely indicate?

AThe data contains no meaningful cluster structure, so K-Means returns arbitrary results

BK-Means has converged to different local minima of the within-cluster sum of squared distances

CThe value of k is incorrect and should be adjusted

DK-Means always produces random output and cannot be trusted for analysis

Question 2 Multiple Choice

A dataset contains two elongated, crescent-shaped clusters that curve around each other. K-Means with k=2 consistently fails to separate them, instead splitting each crescent roughly in half. What is the fundamental reason for this failure?

AK-Means needs more iterations to discover curved cluster boundaries

BThe features require standardization before K-Means can handle non-circular shapes

CK-Means assigns points to the nearest centroid using Euclidean distance, implicitly assuming spherical clusters — curved or irregular shapes violate this assumption

Dk=2 is too small; increasing k would resolve the problem

Question 3 True / False

Each iteration of the K-Means algorithm is guaranteed to reduce or maintain the total within-cluster sum of squared distances.

TTrue

FFalse

Question 4 True / False

The elbow method identifies the optimal value of k by selecting the k that produces the lowest inertia.

TTrue

FFalse

Question 5 Short Answer

Why does K-Means++ improve on random initialization, and what property of the initial centroids does it aim to achieve?

Think about your answer, then reveal below.

Questions: K-Means Clustering