Mental imagery is the capacity to represent and manipulate perceptual information in the absence of direct sensory input. Shepard and Metzler's mental rotation studies showed that response time increases linearly with the angular difference between two shapes, as if subjects are rotating an internal image — suggesting imagery shares computational processes with visual perception. The debate between depictive theories (Kosslyn: quasi-pictorial representations in a spatial medium) and propositional theories (Pylyshyn: symbolic descriptions) has driven fundamental questions about the format of mental representations.
Perform the mental rotation task on 3D figures and notice the continuous time increase with angle. Then attempt to navigate a familiar building in imagination — both tasks engage the visuospatial sketchpad and posterior cortical regions.
Mental imagery sits at the intersection of perception and cognition: it is the capacity to activate perceptual-like representations in the absence of the corresponding sensory input. When you visualize the route from your home to a coffee shop, or imagine rotating an object to see its other side, you are drawing on the same representational resources that support visual perception — but driven from within rather than from the retina. This connection to perception is not merely metaphorical. From your study of visual processing pathways, you know that early visual cortex (V1, V2) normally receives bottom-up input from the retina and top-down feedback from higher areas. Neuroimaging and TMS studies have shown that V1 is activated during mental imagery, and disrupting V1 with TMS impairs not only perception but also imagery. The brain is, in a meaningful sense, running the perceptual system "offline."
The most important experimental evidence for the analog structure of imagery comes from Shepard and Metzler's mental rotation studies. Participants were shown pairs of 3D block figures and asked whether they were the same shape presented at different orientations. Critically, reaction time increased linearly with the angular difference between the two figures, as if subjects were literally rotating one image to match the other at a fixed rotational speed. If mental imagery were purely propositional (a list of abstract features like "arm extends 90 degrees upward"), there would be no reason for rotation time to scale with angle — a lookup or inference would take the same time regardless. The continuous relationship between angle and time is the signature of analog representation: the mental image preserves the spatial structure and metrical relationships of the original object in a form that must be mentally traversed rather than simply retrieved.
The depictive vs. propositional debate crystallized around this evidence. Kosslyn argued that mental images are quasi-pictorial representations in a spatial medium — something like a display buffer in which positions and distances are meaningful in a way that mirrors physical space (image scanning studies showed that "mentally traveling" across a larger map takes longer, even when the map is imagined from memory). Pylyshyn's propositional theory countered that apparent analog behavior could be explained by tacit knowledge: people know that rotation takes time, so they produce the expected pattern as an artifact of their implicit theories about how imagery should work. The debate exposed a deep question about the format of mental representation that straightforward behavioral evidence cannot decisively resolve.
Contemporary consensus has moved toward acknowledging that imagery involves both analog spatial properties and higher-level symbolic or propositional content. Neural evidence — especially the graded retinotopic activation in early visual cortex during imagery — supports genuine spatial representation. But imagery is also clearly influenced by knowledge and expectation in ways that pure depictive accounts cannot fully accommodate: mental images are incomplete, constructive, and subject to systematic distortions. Understanding mental imagery as a controlled, partial activation of the visual processing hierarchy provides the framework for its role in analogical reasoning, problem solving, and the broader architecture of spatial cognition — topics you will build on next.