A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

Spatial Audio and Ambisonics

Research Depth 81 in the knowledge graph ☐ I know this ☆ Set as goal

346prerequisites beneath it

Core Idea

Spatial audio describes techniques for reproducing sound in three-dimensional space, going beyond the left-right stereo field to include height and depth dimensions. Where conventional stereo uses two channels to create a horizontal image, spatial audio formats encode the full sphere of possible sound positions — above, below, front, back, and all angles between.

Ambisonics is a full-sphere surround sound technique developed by Michael Gerzon in the 1970s. Rather than recording individual speaker feeds, Ambisonics encodes the soundfield as a set of mathematical components (B-format signals) that describe the acoustic pressure and directional velocity at a single point in space. First-order Ambisonics (FOA) uses four channels (W, X, Y, Z — omnidirectional pressure plus three directional components). Higher-order Ambisonics (HOA) uses additional channels to encode finer spatial detail: second-order uses nine channels, third-order uses sixteen. The advantage of Ambisonics is format agnosticism — a single recorded or mixed B-format file can be decoded for any speaker array (stereo, quad, 5.1, 7.1.4, headphones with HRTF) at playback time.

HRTF (Head-Related Transfer Function) is the acoustic transformation applied to sound as it diffracts around the head and pinnae (outer ears) before reaching the eardrums. HRTFs encode the cues that allow localization of sounds in three dimensions from just two ears. Convolving audio with HRTF filters produces binaural audio — headphone playback that creates convincing height and out-of-head sound placement. Personalized HRTFs (measured from a specific individual) produce more accurate localization than generic averages, which is why platforms like Apple offer HRTF personalization through iPhone scanning.

Object-based audio formats (Dolby Atmos, Sony 360 Reality Audio, MPEG-H) encode audio as individual sound objects with positional metadata rather than fixed speaker channel assignments. At playback, the renderer maps each object to the available speakers or headphones using the HRTF or speaker feed calculation appropriate to the playback configuration. This format flexibility is why a Dolby Atmos mix can play on a 7.1.4 cinema system, a 5.1.2 home theater, or headphones — the same mix data is decoded differently for each context.

Explainer

Spatial audio represents the leading edge of consumer and professional audio technology. Apple's Spatial Audio (headphone Atmos with dynamic head tracking), Sony's 360 Reality Audio, and Amazon Music's Spatial Audio catalog are driving mainstream adoption of binaural and object-based formats. Simultaneously, immersive audio for VR, AR, and spatial computing demands accurate, interactive three-dimensional audio with head-tracked HRTF rendering.

The technical demands of spatial audio production are substantially higher than stereo. Mixing in Atmos requires atmos-capable software (Pro Tools + Dolby Atmos Production Suite, Logic Pro's Spatial Audio mixer, Nuendo) and a speaker array (at minimum a 7.1.4 bed) for monitoring. Evaluating object placement and height imaging requires both speaker monitoring and binaural headphone checking, as the listening experience differs significantly between systems.

Ambisonics has become particularly important for VR and 360-degree video, where the listener's head orientation changes dynamically. A recorded Ambisonic soundfield can be rotated in real time to match head tracking, maintaining correct spatial correspondence between visual and auditory scenes. This interactivity distinguishes spatial audio production for immersive media from traditional post-production workflows, requiring new tools, new monitoring approaches, and a fundamentally different way of thinking about the relationship between sound and space.

What did you take from this?

Topics in reflective domains aren't scored by quiz answers. Read, reflect, and mark when you've thought it through.

Quiz me anyway →

Prerequisite Chain

Understanding Zero → The Number Zero → Counting to Five → Counting to 10 → Counting to 20 → Counting a Set of Objects Up to 20 → Cardinality: The Last Number Counted → Matching Numerals to Quantities → Subitizing Small Quantities → Addition Within 10 → Number Bonds to 10 → Addition Within 20 → Doubles and Near Doubles → Doubles Facts Within 10 → Near Doubles Facts Within 20 → Mental Math Strategies for Addition → Mental Math: Adding and Subtracting Tens → Addition Within 100 → Repeated Addition as Multiplication → Multiplication as Equal Groups → Multiplication: Arrays → Basic Multiplication Facts (0s, 1s, 2s, 5s, 10s) → Multiplication Facts Within 100 → Division as Equal Sharing → Division as Grouping (Measurement Division) → Division: Grouping (Repeated Subtraction) Model → Division: Fair Sharing Model → Division as Equal Sharing → Division as Grouping → Basic Division Facts → Division Facts Within 100 → Multiplication and Division Fact Families → Relationship Between Multiplication and Division → Division Facts as Inverse of Multiplication → Remainders and Quotients in Division → Division Word Problems → Multi-Step Word Problems → Solving Multi-Step Word Problems → Multiplication Word Problems → Division Word Problems → Introduction to Long Division → Factors and Multiples → Prime and Composite Numbers → Equivalent Fractions → Relating Fractions and Decimals → Decimal Place Value → Integers and the Number Line → Comparing and Ordering Integers → Absolute Value → Adding Integers → Subtracting Integers → Multiplying Integers → Dividing Integers → Unit Rates → Proportions → Percent Concept → Converting Between Fractions, Decimals, and Percents → Operations with Rational Numbers → Two-Step Equations → Solving Multi-Step Equations → Equations with Variables on Both Sides → Literal Equations → Slope-Intercept Form → Point-Slope Form → Writing Linear Equations → Parallel and Perpendicular Line Slopes → Graphing Linear Equations → Piecewise Functions → Step Functions → Composition of Functions → Inverse Functions → Radical Functions and Graphs → Rational Exponents → Exponential Functions and Graphs → Logarithms Introduction → Pitch and Frequency → Digital Audio Fundamentals → Sampling Theory in Audio → Analog-to-Digital Conversion in Audio → Audio Signal Chain Architecture → Reverb and Spatial Effects → Spatial Audio and Ambisonics

Longest path: 82 steps · 346 total prerequisite topics

Prerequisites (1)

Reverb and Spatial Effectshard

Leads To (0)

No topics depend on this one yet.