A nondeterministic finite automaton (NFA) extends the DFA by allowing transitions to zero, one, or multiple states on the same input symbol, as well as ε-transitions that consume no input. An NFA accepts a string if at least one possible computation path ends in an accept state. Nondeterminism is a mathematical convenience, not a physical model — every NFA can be converted to an equivalent DFA, so NFAs recognize the same class of languages. NFAs are often far smaller and easier to construct than equivalent DFAs.
Build NFAs for union and concatenation of two simpler languages to see why nondeterminism is natural for language operations. Then trace the parallel-execution intuition: imagine the NFA forking into multiple copies at each nondeterministic choice, accepting if any copy accepts.
You already know that a deterministic finite automaton processes input by following exactly one transition from each state on each symbol. A nondeterministic finite automaton (NFA) relaxes this constraint in two ways: a state may have zero, one, or many transitions on the same input symbol, and it may have ε-transitions — arrows that the machine can follow without consuming any input at all. Where a DFA walks a single path through its state diagram, an NFA can branch into many paths simultaneously.
The key to understanding NFA acceptance is the "exists" quantifier. An NFA accepts a string if at least one computation path through the branching possibilities ends in an accept state — even if every other path dies or rejects. Think of it like exploring a maze by cloning yourself at every fork: if any clone reaches the exit, you succeed. This is fundamentally different from requiring all paths to accept, which would give you a different (and less useful) computational model.
This branching power makes NFAs remarkably convenient for building automata compositionally. Suppose you have a DFA for language A and a DFA for language B, and you want an automaton for A ∪ B. With DFAs, you need a complex product construction. With NFAs, you simply add a new start state with ε-transitions to the start states of both machines — the nondeterminism lets the machine "guess" which language the input belongs to and verify that guess along one path. The same trick works for concatenation and Kleene star, which is why NFAs are the natural intermediate representation when converting regular expressions to automata.
Despite their apparent extra power, NFAs recognize exactly the same class of languages as DFAs — the regular languages. Every NFA can be converted to an equivalent DFA through the subset construction, where each DFA state represents a set of NFA states that could be active simultaneously. The tradeoff is size: an NFA with n states can require up to 2ⁿ DFA states in the worst case. This exponential blowup is why NFAs matter in practice — they can be exponentially more compact than their DFA equivalents, which is crucial for applications like compiler lexical analysis and regular expression engines.