Questions: Hierarchical Clustering

5 questions to test your understanding

Score: 0 / 5
Question 1 Multiple Choice

A data scientist wants clusters that are compact and roughly spherical. Which linkage criterion should they prefer for agglomerative hierarchical clustering?

ASingle linkage, because it minimizes the distance between the nearest points in each cluster pair
BComplete linkage, because using the maximum pairwise distance to define inter-cluster distance tends to produce compact, bounded clusters
CAny linkage criterion produces identical cluster shapes for spherical data
DWard's method cannot be used here because it assumes clusters are normally distributed
Question 2 Multiple Choice

A colleague claims: 'Hierarchical clustering is strictly better than K-means because it automatically determines the correct number of clusters.' What is the most accurate assessment of this claim?

AFully correct — hierarchical clustering objectively determines K from the dendrogram structure
BPartially correct — hierarchical clustering avoids specifying K during computation, but a human must still choose where to cut the dendrogram, which is its own subjective decision
CIncorrect — both methods require you to specify K before running
DIncorrect — K-means also builds a hierarchical structure if you run it with multiple values of K
Question 3 True / False

Hierarchical clustering can reveal whether a dataset has two clear clusters, five, or a continuum — information that a flat K-means analysis cannot easily provide from a single run.

TTrue
FFalse
Question 4 True / False

Single linkage hierarchical clustering produces the most compact, evenly sized clusters and is the best default choice for general-purpose clustering tasks.

TTrue
FFalse
Question 5 Short Answer

You have a dataset of 50,000 observations and want to explore its cluster structure. Why might agglomerative hierarchical clustering be impractical, and what would you do instead?

Think about your answer, then reveal below.