Questions — Recurrent Neural Networks

Question 1 Multiple Choice

An RNN is trained on sequences of length 20, but at inference time it needs to process sequences of length 100. Why can a basic RNN architecture handle this without modification, unlike a standard feedforward network?

ARNNs automatically resize their weight matrices to match sequence length at inference time

BRNNs use weight sharing — the same weight matrices process every time step — so the architecture is independent of sequence length

CRNNs store all sequence elements in a fixed-size lookup table, allowing variable input sizes

DRNNs cannot handle sequences longer than those seen during training; the question assumes a capability RNNs lack

Question 2 Multiple Choice

During backpropagation through time on a 50-step sequence, the gradient of the loss with respect to the initial hidden state involves a chain rule product of 50 Jacobian matrices. What is the most likely problem this creates, and why?

AMemory overflow, because storing 50 intermediate hidden states requires too much RAM

BVanishing gradients: if the weight matrix has eigenvalues less than 1, repeated multiplication drives gradient magnitudes exponentially toward zero, preventing learning of long-range dependencies

CComputational expense is the primary issue, not gradient flow — the math works correctly but slowly

DVanishing gradients only affect the output layer; internal layers receive normal gradient signals

Question 3 True / False

RNNs can theoretically learn to depend on any arbitrarily distant past input in a sequence because the hidden state carries most prior information forward indefinitely.

TTrue

FFalse

Question 4 True / False

Gradient clipping is a complete solution to the gradient instability problem in RNNs because it prevents both vanishing and exploding gradients.

TTrue

FFalse

Question 5 Short Answer

Explain why an RNN's hidden state is both its greatest strength and the source of its main training challenge.

Think about your answer, then reveal below.

Questions: Recurrent Neural Networks