A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

Parsing Preferences and Computational Complexity

Research Depth 77 in the knowledge graph ☐ I know this ☆ Set as goal

170topics build on this

411prerequisites beneath it

Introduction to Psycholinguistics Computational Parsing Algorithms and Complexity +1 more→→Syntactic Parsing Algorithms and Models

Core Idea

Parsers exhibit biases toward simpler structures and frequent constructions. Minimal attachment (attaching phrases as low as possible) and late closure (attaching new material to recent constituents) govern initial parsing. Working memory limitations affect how many open dependencies can be maintained; high dependencies (long-distance relative clauses) are harder than low dependencies. Parsing preferences interact with grammatical constraints and input frequency to determine comprehension difficulty.

How It's Best Learned

Predict parsing preferences for ambiguous sentences and test with reading-time experiments. Manipulate complexity factors (embedding depth, number of dependencies) and measure comprehension difficulty.

Common Misconceptions

Parsing preferences are not hard constraints; they bias initial analysis but can be overcome by strong cues.
Working memory limitations are not absolute; they vary by individual and interact with linguistic factors.

Explainer

From your psycholinguistics background and work with garden-path sentences, you know that parsing is not a neutral process of recovering structure — the parser has preferences, and those preferences create predictable patterns of difficulty. The study of parsing preferences and computational complexity maps those patterns systematically: why are some sentences easy to understand even when they are long, while others are difficult even when they are short?

The two most studied parsing preferences are minimal attachment and late closure. Minimal attachment means the parser attaches incoming material using the fewest additional syntactic nodes possible — preferring to extend an existing phrase rather than open a new one. Late closure means the parser prefers to attach new words to the most recently opened syntactic constituent rather than closing it and opening a new phrase. These are both efficiency heuristics: they minimize the structural complexity the parser must track at any moment. The preferences usually produce correct results, but when they lead to the wrong analysis (as in garden paths), the parser must revise — and the cost of revision is what makes complexity measurable.

Working memory is the resource constraint that underlies many complexity effects. Parsing requires simultaneously maintaining an incomplete structure in memory while integrating new words into it. The key variable is dependency distance: how far apart are the words that must be linked for the sentence to be understood? In a simple sentence (*The cat chased the mouse*), the verb and its arguments are close together. In a center-embedded clause (*The reporter that the senator that the lobbyist attacked accused ran*), the main verb *ran* is separated from its subject *reporter* by two intervening clauses — the dependency must be held open across multiple intervening words. Sentences with long, overlapping dependencies are dramatically harder to process than sentences where dependencies are short and resolved quickly, even when both are grammatical.

This explains an otherwise puzzling asymmetry: subject relative clauses (*The reporter who attacked the senator*) are consistently easier than object relative clauses (*The reporter that the senator attacked*), even though both are grammatical. In the subject relative, the relativized element (*reporter*) is in the same position it would occupy in a simple sentence (subject). In the object relative, the relativized element is in the object position while the subject of the relative clause intervenes — creating a longer dependency and a less frequent structural pattern. Processing difficulty tracks both dependency length (how long a gap must be held in memory) and frequency (how often this structure type appears in input). High-frequency structures are faster because the parser has acquired stronger expectations for them. This interaction between memory constraints and input statistics makes parsing complexity a window into both the architecture of the parsing system and the statistical structure of the language learner's environment.

Practice Questions 5 questions

Prerequisite Chain

Understanding Zero → The Number Zero → Counting to Five → Counting to 10 → Counting to 20 → Counting a Set of Objects Up to 20 → Cardinality: The Last Number Counted → Matching Numerals to Quantities → Subitizing Small Quantities → Addition Within 10 → Number Bonds to 10 → Addition Within 20 → Doubles and Near Doubles → Doubles Facts Within 10 → Near Doubles Facts Within 20 → Mental Math Strategies for Addition → Mental Math: Adding and Subtracting Tens → Addition Within 100 → Repeated Addition as Multiplication → Multiplication as Equal Groups → Multiplication: Arrays → Basic Multiplication Facts (0s, 1s, 2s, 5s, 10s) → Multiplication Facts Within 100 → Division as Equal Sharing → Division as Grouping (Measurement Division) → Division: Grouping (Repeated Subtraction) Model → Division: Fair Sharing Model → Division as Equal Sharing → Division as Grouping → Basic Division Facts → Division Facts Within 100 → Multiplication and Division Fact Families → Relationship Between Multiplication and Division → Division Facts as Inverse of Multiplication → Remainders and Quotients in Division → Division Word Problems → Multi-Step Word Problems → Solving Multi-Step Word Problems → Multiplication Word Problems → Division Word Problems → Introduction to Long Division → Factors and Multiples → Prime and Composite Numbers → Equivalent Fractions → Relating Fractions and Decimals → Decimal Place Value → Integers and the Number Line → Comparing and Ordering Integers → Absolute Value → Adding Integers → Subtracting Integers → Multiplying Integers → Introduction to Exponents → Order of Operations → Integer Order of Operations → Variable Expressions → The Distributive Property → Variables and Expressions Review → Introduction to Polynomials → Adding and Subtracting Polynomials → Multiplying Polynomials → Factorial → Permutations → Combinations → Counting Principles: Addition and Multiplication Rules → Introduction to Graph Theory → Propositional Logic Foundations → Logical Equivalences → Set Operations: Union, Intersection, and Complement → Cartesian Products and Relations → Partial Orders → Binary Relations → Equivalence Relations → Injective, Surjective, and Bijective Functions → Lambda Calculus → Lambda Calculus for Linguistic Semantics → Computational Parsing Algorithms and Complexity → Parsing Preferences and Computational Complexity

Longest path: 78 steps · 411 total prerequisite topics

Prerequisites (3)

Introduction to Psycholinguisticshard Sentence Parsing and Garden-Path Sentencessoft Computational Parsing Algorithms and Complexitysoft

Leads To (1)

Syntactic Parsing Algorithms and Modelssoft