Questions — Loss Functions and Objective Functions

Question 1 Multiple Choice

A binary classifier is trained with cross-entropy loss. After 20 epochs, training loss has dropped from 0.9 to 0.3, but training accuracy has stayed at 85% for the last 15 epochs. Which of the following best explains this pattern?

AThe model has overfit: the loss decrease is spurious because the model memorized training labels

BThe model is becoming better calibrated — its probability estimates are growing more confident and accurate — without changing which class it predicts as most likely; loss and accuracy measure different things

CThere is a bug in the loss calculation; accuracy and loss should always move together during training

DThe learning rate is too high, causing loss to decrease while accuracy oscillates around the same value

Question 2 Multiple Choice

A team is building a model to predict house prices. They consider MSE and Huber loss. Their dataset contains a few extreme outliers — houses sold at ten times the typical price due to unusual circumstances. Why might Huber loss be preferable to MSE here?

AHuber loss ignores all errors below a threshold delta, so outliers that fall below the threshold do not affect training

BMSE squares large errors, so the extreme outliers generate enormous gradients that dominate the weight updates and pull the model toward fitting the outliers; Huber loss caps large-error gradients (acting like MAE above delta), limiting the influence of outliers while preserving smooth MSE-like gradients near the minimum

CHuber loss automatically removes outliers from the training batch before computing gradients

DMSE is unbounded, so training diverges when outliers are present; Huber loss ensures convergence by capping total loss

Question 3 True / False

The loss function determines what the model learns to optimize during training, while accuracy and other evaluation metrics capture what you actually care about — and these two can diverge.

TTrue

FFalse

Question 4 True / False

Mean squared error is a good default loss function for binary classification because it directly penalizes wrong class predictions and is simpler to implement than cross-entropy.

TTrue

FFalse

Question 5 Short Answer

Explain why the choice of loss function is a design decision about model behavior, not just a technical implementation detail. Give an example where two loss functions would train models that behave differently even with identical architectures and data.

Think about your answer, then reveal below.

Questions: Loss Functions and Objective Functions