Questions — Hierarchical Clustering

Question 1 Multiple Choice

A data scientist wants clusters that are compact and roughly spherical. Which linkage criterion should they prefer for agglomerative hierarchical clustering?

ASingle linkage, because it minimizes the distance between the nearest points in each cluster pair

BComplete linkage, because using the maximum pairwise distance to define inter-cluster distance tends to produce compact, bounded clusters

CAny linkage criterion produces identical cluster shapes for spherical data

DWard's method cannot be used here because it assumes clusters are normally distributed

Question 2 Multiple Choice

A colleague claims: 'Hierarchical clustering is strictly better than K-means because it automatically determines the correct number of clusters.' What is the most accurate assessment of this claim?

AFully correct — hierarchical clustering objectively determines K from the dendrogram structure

BPartially correct — hierarchical clustering avoids specifying K during computation, but a human must still choose where to cut the dendrogram, which is its own subjective decision

CIncorrect — both methods require you to specify K before running

DIncorrect — K-means also builds a hierarchical structure if you run it with multiple values of K

Question 3 True / False

Hierarchical clustering can reveal whether a dataset has two clear clusters, five, or a continuum — information that a flat K-means analysis cannot easily provide from a single run.

TTrue

FFalse

Question 4 True / False

Single linkage hierarchical clustering produces the most compact, evenly sized clusters and is the best default choice for general-purpose clustering tasks.

TTrue

FFalse

Question 5 Short Answer

You have a dataset of 50,000 observations and want to explore its cluster structure. Why might agglomerative hierarchical clustering be impractical, and what would you do instead?

Think about your answer, then reveal below.

Questions: Hierarchical Clustering