Questions — Semantic Segmentation

Question 1 Multiple Choice

A researcher tries to repurpose a standard CNN image classifier for semantic segmentation by attaching a softmax layer that outputs class probabilities independently for each pixel. What is the fundamental problem with this approach?

ACNNs cannot process images with more than three channels, making pixel-level output impossible

BProgressive pooling and striding reduce spatial resolution so severely that per-pixel localization is lost by the final layers

CSoftmax normalization across all pixels forces the model to assign each class to exactly one region

DThe classification loss function is incompatible with pixel-level supervision

Question 2 Multiple Choice

A semantic segmentation model produces accurate class predictions but jagged, imprecise boundaries around objects. Which architectural modification would most directly address this?

AAdding more pooling layers to increase the semantic richness of features

BReplacing dilated convolutions with standard convolutions to reduce receptive field size

CAdding skip connections that forward high-resolution feature maps from early encoder layers to the decoder

DIncreasing the number of output classes to capture finer boundary categories

Question 3 True / False

Dilated (atrous) convolutions expand the receptive field by adding more learnable parameters to the convolutional kernel.

TTrue

FFalse

Question 4 True / False

Skip connections in encoder-decoder segmentation models (such as U-Net) allow the decoder to recover fine spatial details that are progressively lost during encoding.

TTrue

FFalse

Question 5 Short Answer

Explain the fundamental tension in semantic segmentation between spatial resolution and semantic richness, and describe how encoder-decoder architectures resolve it.

Think about your answer, then reveal below.

Questions: Semantic Segmentation