Dopamine neurons in the ventral tegmental area and substantia nigra encode reward prediction errors and motivation. The mesolimbic pathway (VTA to nucleus accumbens) drives reward-seeking; the mesocortical pathway (VTA to prefrontal cortex) supports goal-directed decision-making. Dopamine doesn't directly code pleasure but rather incentive salience—the 'wanting' for rewarding stimuli. This system is hijacked by drugs of abuse, creating addiction.
You already know that dopamine is a neuromodulator released in reward contexts. The key insight here is *what* dopamine actually encodes — not pleasure itself, but the discrepancy between what you expected and what you got. This is the reward prediction error (RPE). When something good happens unexpectedly, dopamine neurons in the ventral tegmental area (VTA) fire in a burst. When something expected to be rewarding fails to materialize, dopamine dips below baseline. If everything goes exactly as predicted, dopamine does not change at all. This is not the language of pleasure — it is the language of learning: "update your model of the world."
Two distinct pathways carry this signal to different destinations. The mesolimbic pathway runs from VTA to the nucleus accumbens (NAc) in the basal forebrain — the core reward circuit. Dopamine release here drives approach behavior and reinforcement learning, essentially stamping in the association "do more of what led here." The mesocortical pathway runs from VTA to prefrontal cortex, supporting working memory, planning, and goal-directed decision-making. Think of mesolimbic as the accelerator — generating the pull toward a reward — and mesocortical as the steering wheel, directing which goals get prioritized and how to pursue them.
A crucial conceptual distinction separates "wanting" from "liking." Incentive salience — the motivational pull toward a stimulus — is dopamine-dependent. But hedonic pleasure (the actual experience of enjoying a reward) depends more on opioid and endocannabinoid systems within the NAc. These can be dissociated: dopamine depletion reduces wanting without eliminating liking, and dopamine flooding increases wanting without proportionally increasing liking. This distinction explains a puzzling feature of addiction: people report intense craving ("wanting") for a drug whose subjective pleasure has diminished substantially with repeated use. The motivational system is intact and pointing urgently — just at the wrong target.
The addiction hijacking follows directly from the RPE framework. Drugs like cocaine and amphetamines flood the NAc with dopamine, producing a prediction error signal far larger than any natural reward. The system learns with exaggerated force, and over time, even drug-associated cues — sights, sounds, contexts — begin triggering dopamine release before any drug is taken. This is cue-induced craving: the anticipatory wanting that makes relapse so persistent even after extended abstinence. The dopamine system is not broken; it is working exactly as designed — it has simply been trained by pharmacologically amplified prediction errors to treat drug-related cues as the most important stimuli in the environment.