Questions — DBSCAN Clustering — Open Knowledge Graph

Question 1 Multiple Choice

A dataset consists of two interlocking crescent shapes. You run both k-means (k=2) and DBSCAN on it. What do you expect?

ABoth algorithms correctly identify the two crescents as separate clusters

BK-means succeeds because it finds the two natural groups; DBSCAN fails because it cannot handle curved shapes

CDBSCAN correctly identifies the two crescents; k-means fails because it assumes spherical clusters and cannot separate interlocking shapes

DNeither algorithm can handle this dataset without feature engineering

Question 2 Multiple Choice

You run DBSCAN on a dataset and nearly every point is labeled as noise. What is the most likely cause?

Aeps is too large, causing all points to merge into one cluster

Bmin_pts is set to 1, making every point its own cluster

Ceps is too small, so most points don't have enough neighbors within the radius to qualify as core points

DThe dataset has too many dimensions for DBSCAN to function

Question 3 True / False

DBSCAN can identify clusters of arbitrary shape because it defines clusters based on density-connectivity rather than distance to a cluster centroid.

TTrue

FFalse

Question 4 True / False

In DBSCAN, nearly every data point is assigned to exactly one cluster — points that don't fit well are assigned to the nearest cluster as border points.

TTrue

FFalse

Question 5 Short Answer

Why does DBSCAN require two parameters (eps and min_pts) rather than one, and what aspect of cluster structure does each control?

Think about your answer, then reveal below.

Questions: DBSCAN Clustering