Prosodic structure is formally organized into hierarchical levels (mora, syllable, foot, word, phrase, utterance) with constraints on each level. Stress assignment rules and metrical grids provide formal accounts of rhythm and prominence patterns across languages.
From your study of metrical phonology and formal phonotactics, you know that stress is not random — it is governed by principles that operate on the internal structure of words, and that phonological systems obey constraints on which sound sequences are permitted. Prosodic structure formalism provides the overarching framework that unifies these insights: it proposes that phonological representations are organized into a strict prosodic hierarchy, where each level of structure is built from units of the level immediately below.
The hierarchy runs from smallest to largest: mora → syllable → foot → prosodic word → phonological phrase → intonational phrase → utterance. Each level has its own well-formedness constraints. At the mora level, languages distinguish between light syllables (one mora, typically V or CV) and heavy syllables (two morae, typically CVC or CVV) — a distinction that governs stress assignment in quantity-sensitive languages. At the foot level, languages organize syllables into rhythmic units; the most common foot types are the trochee (strong-weak: *TA-ble*) and the iamb (weak-strong: *a-LONE*), and individual languages typically prefer one type consistently. Stress in a word then falls on the prominent syllable of the head foot.
Metrical grids provide a formal notation for representing relative prominence across these levels. Each syllable occupies a column of grid marks, with more grid marks indicating greater prominence. A syllable that is stressed at the word level has marks at both the syllable row and the word row; the most prominent syllable in a phrase adds a mark at the phrase row. The grid makes visible the eurhythmy effects that occur when languages avoid adjacent prominent beats (stress clash) or adjacent non-prominent beats (stress lapse) — phenomena that drive stress shift (the "rhythm rule") and elucidate why sentences like *thirteen MEN* become *THIRteen men* when a stressed syllable follows.
The formal power of this framework is that the same hierarchical architecture accounts for phenomena at multiple levels simultaneously. Prosodic phrasing — how utterances break into chunks at the phrase and intonational-phrase levels — is governed by syntactic and semantic factors interacting with prosodic constraints, and these interactions produce the pausing and pitch-reset patterns that listeners interpret as clause and phrase boundaries. Clitic attachment (whether function words lean rhythmically on adjacent content words) is determined by the prosodic word boundary. By treating prosodic structure as a hierarchy with level-specific constraints, the formalism generates precise, falsifiable predictions about which strings are well-formed, how stress shifts under rhythmic pressure, and how phrase boundaries align with syntactic structure — providing the analytical vocabulary for rigorous cross-linguistic comparison.