Merge is the core generative operation in minimalist syntax: it combines two linguistic objects into a new object. Unlike X-bar theory's phrase structure rules, merge is a binary, recursive operation that generates all syntactic structures without category-specific rules. It provides the foundation for understanding how the finite cognitive system generates infinite linguistic expressions.
Start with simple binary trees (VP, NP structures) and trace how merge builds them recursively. Then examine how merge differs from traditional phrase structure rules by its uniform, category-independent nature.
In X-bar theory — your hard prerequisite — phrase structure was described by a set of rules: VP → V NP, NP → Det N, and so on. Each rule was category-specific, telling you exactly what constituents could combine with what. This worked descriptively, but it raised a pressing question: why are there so many rules, and why do they all share the same basic shape (head followed by complements, then specifiers)? The minimalist program, which you've studied through its core concepts, answers by collapsing all of those rules into a single operation: Merge.
Merge is radically simple. It takes two syntactic objects — call them α and β — and combines them into a new set {α, β}. That's the entire definition. No category labels, no ordering specification, no language-specific parameters. The same operation that builds a verb phrase, a noun phrase, and a clause is identical in each case. What changes is *what* you merge, not the operation itself. This simplicity is theoretically attractive because it reduces the machinery the human language faculty needs to maintain: instead of dozens of phrase structure rules, the grammar contains exactly one combinatorial operation.
The recursive nature of Merge is where the real power lies. When you merge two words, you get a phrase. When you merge that phrase with another element, you get a larger phrase. That larger phrase can merge again, and again, with no principled upper limit. This recursive self-embedding — sentences within sentences, clauses within clauses — is one of the most distinctive properties of human language, and Merge derives it automatically. A child who has acquired Merge has acquired infinite expressive capacity, because any syntactic object can serve as an input to a new Merge operation.
External Merge and Internal Merge are the two applications of the operation you need to distinguish. External Merge combines two previously independent objects — this builds basic phrase structure. Internal Merge takes an element already inside a structure and merges it with the root of that structure, creating a copy. This is the minimalist reanalysis of movement: phenomena like question formation (where a wh-word appears at the front of the sentence) or passivization are not the result of a separate "Move" operation, but of Merge applying to an object that is already part of the structure. By unifying structure-building and movement under a single operation, the minimalist program achieves a powerful theoretical compression — two phenomena that appeared distinct in X-bar theory turn out to be instances of the same underlying computation, differing only in whether Merge reaches into the existing structure or brings in entirely new material.