Kleene's Theorem states that the three models — DFAs, NFAs, and regular expressions — all define exactly the same class of languages (the regular languages). The theorem is proved constructively: Thompson's construction converts any regular expression to an NFA, subset construction converts NFAs to DFAs, and state elimination converts DFAs back to regular expressions. This equivalence justifies treating these three formalisms as interchangeable descriptions of regular languages.
You have already worked with two seemingly different ways to describe patterns in strings: regular expressions, which specify patterns declaratively using operators like union, concatenation, and Kleene star, and finite automata (both DFAs and NFAs), which recognize patterns by stepping through states as they read input. These formalisms look and feel very different — one is algebraic notation, the other is a state machine. Kleene's theorem establishes that they are exactly equivalent in power: any language describable by a regular expression can be recognized by a finite automaton, and any language recognized by a finite automaton can be described by a regular expression. They define the same class of languages — the regular languages.
The theorem is proved by showing three constructive conversions that form a complete cycle. Thompson's construction converts any regular expression into an equivalent NFA. The idea is compositional: base cases (single characters, empty string) become simple two-state NFAs, and the regex operators (union, concatenation, star) correspond to ways of wiring smaller NFAs together with epsilon transitions. You already know the second conversion — subset construction (also called the powerset construction) — which converts any NFA into an equivalent DFA by treating sets of NFA states as single DFA states. The third conversion, state elimination, goes from a DFA back to a regular expression by systematically removing states and labeling the remaining transitions with increasingly complex regex patterns that account for the removed paths.
The power of this equivalence is that you can freely choose whichever formalism is most convenient for the task at hand. Need to prove a language is regular? Write a regular expression — it may take one line. Need to *implement* recognition efficiently? Build a DFA — it processes each input character in constant time with no backtracking. Need to reason about nondeterministic choices? Use an NFA — its structure may be more transparent. Kleene's theorem guarantees that anything you can express in one formalism has an exact counterpart in the others, so you never lose generality by choosing the most convenient representation.
It is worth noting what the theorem does *not* say. The conversions are not always size-preserving: converting an NFA to a DFA can cause an exponential blowup in the number of states (since DFA states correspond to *subsets* of NFA states), and converting a DFA to a regular expression via state elimination can produce expressions exponentially larger than the original automaton. The equivalence is about expressive power — what languages can be described — not about efficiency of representation. Kleene's theorem also does not extend beyond finite-state models. Context-free languages, for instance, are recognized by pushdown automata but cannot in general be described by regular expressions. The equivalence holds precisely at the regular level of the Chomsky hierarchy.
No topics depend on this one yet.