Questions: Semantic Segmentation

5 questions to test your understanding

Score: 0 / 5
Question 1 Multiple Choice

A researcher tries to repurpose a standard CNN image classifier for semantic segmentation by attaching a softmax layer that outputs class probabilities independently for each pixel. What is the fundamental problem with this approach?

ACNNs cannot process images with more than three channels, making pixel-level output impossible
BProgressive pooling and striding reduce spatial resolution so severely that per-pixel localization is lost by the final layers
CSoftmax normalization across all pixels forces the model to assign each class to exactly one region
DThe classification loss function is incompatible with pixel-level supervision
Question 2 Multiple Choice

A semantic segmentation model produces accurate class predictions but jagged, imprecise boundaries around objects. Which architectural modification would most directly address this?

AAdding more pooling layers to increase the semantic richness of features
BReplacing dilated convolutions with standard convolutions to reduce receptive field size
CAdding skip connections that forward high-resolution feature maps from early encoder layers to the decoder
DIncreasing the number of output classes to capture finer boundary categories
Question 3 True / False

Dilated (atrous) convolutions expand the receptive field by adding more learnable parameters to the convolutional kernel.

TTrue
FFalse
Question 4 True / False

Skip connections in encoder-decoder segmentation models (such as U-Net) allow the decoder to recover fine spatial details that are progressively lost during encoding.

TTrue
FFalse
Question 5 Short Answer

Explain the fundamental tension in semantic segmentation between spatial resolution and semantic richness, and describe how encoder-decoder architectures resolve it.

Think about your answer, then reveal below.