5 questions to test your understanding
Two replicas each store 1,024 data blocks organized in a Merkle tree of depth 10. They find their root hashes differ. If only one block differs, what is the maximum number of hash comparisons needed to identify it?
Two replicas compute their Merkle trees over the same dataset and find that their root hashes match. What can they conclude?
Merkle trees eliminate the CPU cost of consistency checking because hashes can be computed without reading the underlying data.
A Merkle tree allows two replicas to locate data differences in O(log n) hash exchanges rather than O(n) data transfers, making anti-entropy far more bandwidth-efficient.
Explain why the Merkle tree comparison is described as a 'logarithmic search.' What structural property of the tree enables this efficiency, and when does the approach provide the greatest bandwidth savings?