Questions: Dopamine Pathways: Reward, Motivation, and Learning
5 questions to test your understanding
Score: 0 / 5
Question 1 Multiple Choice
A rat's dopamine neurons are recorded while it learns that a tone reliably predicts food. After many training trials, which firing pattern is observed when the tone sounds and when food arrives?
ADopamine neurons fire strongly at food delivery throughout training, because food is pleasurable
BDopamine neurons shift their response to fire at the tone and no longer burst when food arrives, because the reward is now fully predicted
CDopamine neurons stop firing entirely once the association is learned, since updating is complete
DDopamine neurons fire equally at both the tone and food delivery, encoding the full reward sequence
This is the classic Schultz reward prediction error finding. Early in training, dopamine neurons fire at food delivery (unexpected reward). As learning proceeds, the response shifts backward to the predictive cue (the tone) — the earliest reliable signal of reward. Simultaneously, the response to food delivery disappears, because the reward is now fully predicted and the prediction error is zero. Dopamine encodes the discrepancy between expected and actual outcomes, not pleasure itself. A predicted reward produces no dopamine burst even though the hedonic experience still occurs.
Question 2 Multiple Choice
A person in addiction treatment reports intense cravings for a drug whose subjective high has become much weaker after years of use. Which aspect of dopamine neuroscience best explains this paradox?
ATheir dopamine system has been destroyed by drug use, and they crave the drug to restore normal dopamine baseline
BWanting (dopamine-mediated incentive salience) and liking (opioid-mediated hedonic pleasure) can be dissociated — the motivational system remains strongly trained toward drug cues even as hedonic response diminishes
CThe mesocortical pathway has been suppressed, eliminating the ability to find reward in other activities
DRepeated drug use causes dopamine neurons to require ever-larger doses to fire, directly causing escalating craving
The wanting/liking dissociation is the key. 'Wanting' — incentive salience, the motivational pull — is dopamine-dependent and centered in the mesolimbic pathway (VTA → nucleus accumbens). 'Liking' — hedonic pleasure — depends more on opioid and endocannabinoid systems within the nucleus accumbens. Drugs of abuse massively amplify dopamine signaling, training the mesolimbic system with exaggerated prediction errors. Drug-associated cues become powerful dopamine triggers for wanting even as tolerance reduces hedonic response. The motivational system is working exactly as designed — it has simply been trained by pharmacologically amplified signals to treat drug cues as the highest-priority stimuli.
Question 3 True / False
Dopamine neurons encode pleasure directly — they fire more when an experience is more pleasurable and less when it is less pleasurable.
TTrue
FFalse
Answer: False
This is the most pervasive misconception about dopamine. Dopamine neurons encode reward prediction errors — the discrepancy between expected and actual reward — not pleasure. A fully predicted reward produces no dopamine burst even though the hedonic experience still occurs. An unexpected reward triggers dopamine firing even if small. Actual pleasure is mediated primarily by opioid and endocannabinoid systems within the nucleus accumbens. Dopamine is the 'wanting' signal (incentive salience); the 'liking' signal (hedonic value) is a separate system. The two can be dissociated pharmacologically and behaviorally.
Question 4 True / False
The mesolimbic dopamine pathway (VTA → nucleus accumbens) and the mesocortical pathway (VTA → prefrontal cortex) originate from the same brain region but serve functionally distinct roles in reward-related behavior.
TTrue
FFalse
Answer: True
Both pathways arise from dopamine neurons in the ventral tegmental area (VTA) but project to different destinations and serve distinct functions. The mesolimbic pathway (VTA → nucleus accumbens) drives approach behavior and reinforcement learning — it stamps in 'do more of this' associations and generates the motivational pull toward rewards. The mesocortical pathway (VTA → prefrontal cortex) supports working memory, planning, and goal-directed decision-making. Disrupting each produces different clinical pictures: mesolimbic disruption impairs motivation and reinforcement; mesocortical disruption impairs executive function and planning.
Question 5 Short Answer
What is a reward prediction error (RPE), and why is it a more accurate description of what dopamine neurons encode than 'how rewarding the experience was'?
Think about your answer, then reveal below.
Model answer: A reward prediction error is the difference between the reward actually received and the reward that was expected. Dopamine neurons fire in bursts when reward exceeds expectations (positive RPE), dip below baseline when expected reward fails to arrive (negative RPE), and show no change when reward exactly matches prediction (zero RPE). This is not the same as encoding reward magnitude: a predicted large reward produces no dopamine change at delivery, while an unexpected small reward produces a burst. The RPE framework shows dopamine is a learning signal — 'update your world model in this direction and by this amount' — not a pleasure signal.
The RPE framework reveals dopamine as implementing a form of temporal difference learning — an algorithm for updating predictions based on discrepancies between expected and actual outcomes. This is why drugs that artificially flood the nucleus accumbens with dopamine are powerfully addictive: they generate large, context-free prediction errors that train the motivational system with no connection to genuinely beneficial outcomes, producing the intense, maladaptive wanting that characterizes addiction even after the hedonic response has diminished.