The orbitofrontal cortex (OFC) represents subjective value across different decision contexts and rewarding stimuli. OFC neurons encode the expected value of actions and outcomes, with this valuation updated based on experience and changing task demands. OFC damage impairs decision-making and reversal learning, causing perseveration on choices that are no longer rewarded, indicating its necessity for adaptive decision-making.
From your study of dopamine and the reward system, you know that midbrain dopamine neurons encode reward prediction errors — the difference between what was expected and what actually occurred — and that this signal is broadcast to the striatum and prefrontal cortex to update action values. From neuroeconomics, you know that decision-making involves computing the subjective expected value of options, weighing probability by utility. The orbitofrontal cortex is the region that sits at the convergence of these systems: it transforms sensory information about potential rewards into the common-currency value representations that guide choice.
The OFC occupies the ventral surface of the prefrontal cortex, directly above the orbital bones. Its anatomical position reflects its functional role: it receives dense input from all sensory modalities (taste, smell, visual, somatosensory), from the amygdala, and from the striatal reward circuit, while projecting back to the striatum, hypothalamus, and brainstem. This convergence makes the OFC ideally positioned to integrate the hedonic properties of a stimulus (how pleasant is it?) with its current motivational salience (how much do I want it right now, given my current state?). OFC neurons have been recorded firing in response to reward cues, reward delivery, and — critically — reward omission, in a pattern consistent with encoding expected value that is continuously updated by experience.
The decisive evidence for OFC's role in flexible value updating comes from reversal learning paradigms. In a typical experiment, an animal or human learns that stimulus A yields reward and stimulus B does not; then the contingencies are reversed, so B now yields reward and A does not. The healthy system quickly detects the reversal and shifts behavior. OFC-lesioned subjects fail to make this shift: they continue choosing the previously rewarded stimulus, a pattern called perseveration. The deficit is not a failure of perception or movement — they can detect that outcomes have changed — but a failure to update the value representation that drives choice. The now-unrewarded option retains its old value tag, and the subject acts on that stale information. This is why OFC damage is so disabling: in a world where reward contingencies change constantly (relationships, workplaces, food sources), inability to revise learned values is catastrophic for adaptive behavior.
The Iowa Gambling Task — a classic neuropsychological paradigm where subjects draw from four decks with different long-run reward structures — consistently reveals OFC impairment in patients with ventromedial PFC damage (the human analog of OFC in animal work), as well as in individuals with addiction and psychopathy. These populations share the behavioral signature of continuing to choose options that, by any rational accounting, are net negative. The mechanistic explanation is OFC dysfunction: the regions responsible for encoding and updating learned value fail to transmit the "this isn't working, update your representation" signal that healthy decision-making requires. Understanding the OFC thus connects the reward circuitry you already know to a specific higher-level computational function — real-time, experience-dependent value assignment — that is distinct from both the striatum's action-selection role and the dlPFC's working-memory-based planning.
No topics depend on this one yet.