Questions — Merkle Trees for Distributed Data Consistency

Question 1 Multiple Choice

Two replicas each store 1,024 data blocks organized in a Merkle tree of depth 10. They find their root hashes differ. If only one block differs, what is the maximum number of hash comparisons needed to identify it?

A1,024 — all leaves must be compared to find the mismatched one

B512 — binary search halves the space in one step

CAbout 10 — one pair of children compared per level until the differing leaf is reached

D2 — the root comparison plus one final leaf check

Question 2 Multiple Choice

Two replicas compute their Merkle trees over the same dataset and find that their root hashes match. What can they conclude?

AThe datasets are probably the same, but minor differences could be masked by hash collisions and should be double-checked

BThe datasets are identical with cryptographic certainty (barring hash collisions negligible in practice)

COnly the highest levels of data are guaranteed the same; leaf-level differences might still exist

DThe trees were built consistently but the datasets could still differ due to the birthday paradox at scale

Question 3 True / False

Merkle trees eliminate the CPU cost of consistency checking because hashes can be computed without reading the underlying data.

TTrue

FFalse

Question 4 True / False

A Merkle tree allows two replicas to locate data differences in O(log n) hash exchanges rather than O(n) data transfers, making anti-entropy far more bandwidth-efficient.

TTrue

FFalse

Question 5 Short Answer

Explain why the Merkle tree comparison is described as a 'logarithmic search.' What structural property of the tree enables this efficiency, and when does the approach provide the greatest bandwidth savings?

Think about your answer, then reveal below.

Questions: Merkle Trees for Distributed Data Consistency