Face perception engages specialized regions within the ventral visual stream, particularly the fusiform face area (FFA) in fusiform gyrus, the occipital face area (OFA), and regions in lateral prefrontal and superior temporal cortex. These regions represent facial identity, expression, gaze direction, and biological motion. Some evidence suggests these regions are innately specialized for faces, while other evidence supports view-invariant learning mechanisms.
From your study of the ventral visual stream, you know that visual processing moves from early feature detection in V1 through increasingly complex object representations as information travels ventrally and anteriorly through temporal cortex. By the time signals reach inferotemporal cortex, neurons respond to whole objects regardless of viewing angle — the substrate for visual object recognition. Faces are objects, so face recognition should fit naturally into this framework. And in many ways it does — but there is compelling evidence that faces recruit a partially specialized system within and around this stream, and understanding why reveals something important about how the brain organizes high-level vision.
The core region is the fusiform face area (FFA), a patch of cortex in right fusiform gyrus that responds dramatically more to faces than to other visual categories. The FFA was identified through fMRI studies by Nancy Kanwisher and colleagues in the 1990s: when participants viewed faces, a consistent region of right fusiform gyrus showed reliable signal increases that did not occur for houses, objects, or scrambled images. The FFA shows sensitivity to facial identity, not just face-category detection — it responds differently to different individuals' faces. Crucially, damage to right fusiform (prosopagnosia) causes selective impairment in recognizing familiar faces, including one's own family, while object recognition and basic visual perception remain largely intact. The dissociation — faces damaged, objects spared — is strong evidence for regional specialization.
The FFA is not the only face-selective region. The occipital face area (OFA), in inferior occipital gyrus, sits earlier in the processing hierarchy and may provide structural inputs to the FFA. The posterior superior temporal sulcus (pSTS) responds to dynamic aspects of faces — eye gaze direction, mouth movements, expressions, and the biological motion that signals another person's intentions. This distinction is captured in the Haxby-Hoffman-Gobbini model: a "core" system (FFA, OFA, pSTS) handles the visual analysis of faces, and an "extended" system (amygdala for emotional relevance, prefrontal cortex for intentional attribution) extracts social meaning from that visual analysis. Face perception is a cascade from structural encoding to social inference, not a single process.
There is longstanding debate about *why* faces are treated specially. The domain-specificity view holds that faces are evolutionarily significant stimuli that required and produced dedicated neural machinery. Supporting evidence includes the face inversion effect (inverting a face dramatically impairs recognition far more than inverting other objects), holistic processing (the parts of a face are processed as an integrated whole, not as independent features — demonstrated by the composite face effect), and the other-race effect (faces from racial groups one has less experience with are recognized less accurately and processed less holistically). The expertise view counters that the FFA reflects fine-grained individuation of any homogeneous category, not faces specifically — expert bird-watchers show FFA activity for bird images. Both views have empirical support, and the most likely resolution is that the fusiform region is specialized for fine-grained individuation within categories requiring it, and faces are the category that universally receives intense individuation experience. The debate is a productive reminder that the brain's functional organization reflects both innate structure and developmental experience.