Questions: Ventral Stream and Visual Object Recognition

5 questions to test your understanding

Score: 0 / 5
Question 1 Multiple Choice

A neuron in inferotemporal (IT) cortex responds vigorously when shown a coffee mug viewed from the side. The mug is then rotated so it is viewed from above — a completely different pattern of pixels and edges. How would a typical IT neuron most likely respond?

AIt would not respond — IT neurons are tuned to the specific pixel patterns of trained views
BIt would respond vigorously — IT neurons are largely invariant to changes in viewing angle
CIt would respond only if the rotation was gradual enough for the neuron to track the transformation
DIt would respond only if the mug was also at the same retinal location as the original view
Question 2 Multiple Choice

A patient with damage to the fusiform face area (FFA) struggles to identify familiar faces from photographs. According to the neuroscience of category-selective regions, which additional finding would you most expect?

AComplete inability to recognize any face, with perfectly normal recognition of all other objects
BDifficulty with fine-grained discrimination of other visually similar categories, especially in domains where the patient has expertise
CNormal face recognition in person but impaired performance on photographs, because the FFA requires 3D depth information
DSeverely impaired recognition of all visual objects across all categories equally
Question 3 True / False

In the ventral visual stream, neurons in earlier areas (V1, V2) have smaller receptive fields and respond to simpler features, while neurons in inferotemporal cortex have larger receptive fields and respond to complex shapes and objects.

TTrue
FFalse
Question 4 True / False

Category-selective regions like the fusiform face area (FFA) respond exclusively to their preferred category — a face-selective neuron produces no response at most to non-face objects.

TTrue
FFalse
Question 5 Short Answer

What computational problem does the ventral stream solve with perceptual invariance, and what is the key evidence that this is achieved through a hierarchical process rather than a single transformation?

Think about your answer, then reveal below.