The probabilistic method, pioneered by Erdos, proves the existence of combinatorial objects with desired properties by showing that a random object has the property with positive probability. If Pr[X has property P] > 0, then an object with property P must exist. The first moment method (linearity of expectation) shows that if the expected number of "bad" substructures is less than 1, a good object exists. The second moment method (Chebyshev/Paley-Zygmund) strengthens this by showing concentration. The alteration method generates a random object, then deterministically fixes any defects. These techniques yield otherwise-inaccessible bounds on Ramsey numbers, chromatic numbers, set systems, and circuit complexity, and connect directly to derandomization when the probabilistic argument can be made constructive.
The probabilistic method is one of the most powerful techniques in combinatorics and theoretical computer science. Its fundamental insight is deceptively simple: to prove that an object with property P exists, show that a random object has property P with positive probability. If you draw from a well-chosen probability distribution and the probability of success is positive, then a successful object must exist in the sample space — even if you cannot explicitly construct it. Erdos developed this method into a systematic tool throughout the mid-20th century, obtaining results that no deterministic construction technique has matched.
The first moment method (or expectation argument) is the most basic version. To show that a graph property holds for some graph, define X as the number of "bad" substructures in a random graph, compute E[X] using linearity of expectation, and show E[X] < 1. Since X is a non-negative integer with mean less than 1, it must be 0 for at least one outcome. Erdos's Ramsey bound is the iconic example: the expected number of monochromatic k-cliques in a random 2-coloring of K_n is C(n,k) * 2^(1-C(k,2)), which drops below 1 when n < 2^(k/2). Therefore R(k,k) > 2^(k/2). No explicit construction achieves more than 2^(c*sqrt(k*log k)) — the probabilistic method gives exponentially better bounds than any known construction.
The alteration method extends the first moment approach by allowing a cleanup phase. Generate a random structure that almost has the desired property, then deterministically fix the defects. For maximum independent set: include each vertex with probability p, then delete one endpoint from each surviving edge. The expected independent set size is np - mp^2 (included vertices minus deletions), optimized at p = n/(2m). This yields alpha(G) >= n/(2d_avg), the Turan bound. The alteration step is crucial — the random set is not independent, but the deterministic deletion makes it one while preserving most of the randomly selected vertices. The technique extends to hypergraph coloring, satisfiability, and discrepancy theory.
The second moment method provides a qualitative leap: instead of just proving existence (E[X] > 0 implies X > 0 sometimes), it proves that X is concentrated around its mean. Using the Paley-Zygmund inequality, Pr[X > 0] >= (E[X])^2 / E[X^2], bounding the second moment shows that X is positive with substantial (even high) probability. The key challenge is bounding E[X^2] = E[X]^2 + Var(X): the variance must be shown to be small relative to the square of the mean. This requires carefully analyzing the covariance structure — for indicator variables X = sum I_i, Var(X) = sum Cov(I_i, I_j), and the "diagonal" terms (i = j) contribute E[X], while the "off-diagonal" terms capture dependencies. The second moment method is essential for proving threshold phenomena in random graphs and random constraint satisfaction problems.
The connection to algorithm design runs deeper than mere existence proofs. The method of conditional expectations makes first-moment arguments constructive: if E[f(X)] >= t under a random experiment that fixes bits sequentially, then at each step, choose the bit value that keeps the conditional expectation at least t. This is a deterministic polynomial-time algorithm that achieves the probabilistic bound. More broadly, the probabilistic method provides a design paradigm: first prove that a random algorithm works (via concentration or expectation), then ask whether the proof can be derandomized. This paradigm, connecting the probabilistic method to derandomization via conditional expectations and limited independence, is one of the most productive pipelines in algorithm design.
No topics depend on this one yet.