Questions — Convolutional Neural Networks

Question 1 Multiple Choice

A CNN is trained on images of stop signs appearing in the center of frames. A new image has a stop sign in the upper-left corner. What does the CNN's convolutional architecture predict?

AThe network fails — it learned weights specific to the center position and must be retrained

BThe activation for the stop sign feature shifts to the upper-left of the feature map, because CNNs are translation equivariant

CPooling layers correct for position, producing the same output regardless of where the sign appears

DThe network detects the sign only if it was also trained on images with upper-left stop signs

Question 2 Multiple Choice

Why does weight sharing in a convolutional layer dramatically reduce the number of parameters compared to a fully connected layer processing the same input?

AConvolutional layers use simpler activation functions that require fewer computations

BThe same small filter (e.g., 3×3 weights) is applied at every spatial position, so filter parameters are not duplicated per position

CPooling layers remove most neurons before any learned weights are applied

DCNNs process each color channel independently, reducing the effective input size

Question 3 True / False

A convolutional layer is translation equivariant: moving a feature in the input produces a corresponding shift in the feature map output.

TTrue

FFalse

Question 4 True / False

Max pooling layers are what give CNNs their translation equivariance property.

TTrue

FFalse

Question 5 Short Answer

What inductive bias does a CNN encode, and why does this make it more appropriate than a fully connected network for image classification?

Think about your answer, then reveal below.

Questions: Convolutional Neural Networks