Explain how Fano's inequality is used to prove the converse of the channel coding theorem — that reliable communication above channel capacity is impossible.
Think about your answer, then reveal below.
Model answer: Consider transmitting one of 2^(nR) messages over n uses of a channel with capacity C. The message M is estimated as M-hat from the channel output Y^n. Fano's inequality gives H(M|Y^n) <= 1 + P_e * nR. Meanwhile, the data processing inequality and the capacity definition give I(M; Y^n) <= nC. Since I(M; Y^n) = H(M) - H(M|Y^n) = nR - H(M|Y^n), combining yields nR - 1 - P_e * nR <= nC. If R > C, then for large n, P_e must be bounded away from zero — the error probability cannot vanish. Therefore, reliable communication at rate R > C is impossible.
The converse proof chains together Fano's inequality (connecting error probability to conditional entropy), the definition of mutual information (connecting conditional entropy to information rate), and the channel capacity bound (capping mutual information per channel use). Each link is individually simple; their composition yields the profound result that capacity is a hard ceiling.