Questions: Hyperparameter Optimization

5 questions to test your understanding

Score: 0 / 5
Question 1 Multiple Choice

A model has 6 hyperparameters, but only learning rate and batch size meaningfully affect performance. A researcher runs 50 evaluations. Compared to grid search over all 6 parameters, random search would most likely:

APerform worse because it does not evaluate every combination systematically
BPerform comparably because both methods sample the same number of configurations
CFind better learning rate and batch size values because it explores more distinct values of those important dimensions per evaluation
DOnly outperform grid search for deep learning models, not other model types
Question 2 Multiple Choice

What distinguishes Bayesian optimization from both grid and random search in how it selects configurations to evaluate?

AIt evaluates every combination in the hyperparameter space exhaustively before reporting results
BIt samples configurations randomly but then applies a filter to remove obviously bad ones
CIt builds a probabilistic surrogate model of the performance landscape and uses an acquisition function to direct evaluations toward promising regions
DIt fixes the least important hyperparameters first and then exhaustively searches the remaining ones
Question 3 True / False

Random search is almost seldom better than grid search for hyperparameter optimization because grid search is exhaustive and therefore expected to find the optimal combination.

TTrue
FFalse
Question 4 True / False

Bayesian optimization uses an acquisition function to balance exploring uncertain regions of the hyperparameter space against exploiting regions already known to perform well.

TTrue
FFalse
Question 5 Short Answer

Why does Bayesian optimization typically require fewer training runs than random search to find a high-performing hyperparameter configuration, and when is this advantage most valuable?

Think about your answer, then reveal below.