Questions — Sublinear Algorithms — Open Knowledge Graph

Question 1 Multiple Choice

A sublinear-time algorithm estimates the number of edges in a graph given in adjacency list representation. It samples s random vertices, queries their degrees, and returns n * (average sampled degree) / 2. How large must s be for epsilon-relative error?

As = O(1/epsilon) regardless of the graph structure

Bs = O(sqrt(n) / epsilon^2) — the variance depends on the degree distribution and sqrt(n) is needed because high-degree vertices dominate but are rare

Cs = O(n / epsilon) — you need to sample a constant fraction

Ds = O(log n / epsilon^2) — concentration inequalities handle everything

Question 2 True / False

Every decision problem that can be solved exactly in O(n) time can also be solved approximately (with one-sided error) in o(n) time.

TTrue

FFalse

Question 3 True / False

Sublinear-time algorithms must use randomization — no deterministic sublinear-time algorithm can provide useful approximate answers.

TTrue

FFalse

Question 4 Short Answer

Explain the local-to-global principle that underlies sublinear algorithms: why can global properties of massive datasets sometimes be determined from small random samples?

Think about your answer, then reveal below.

Questions: Sublinear Algorithms