Questions — Hyperparameter Optimization

Question 1 Multiple Choice

A model has 6 hyperparameters, but only learning rate and batch size meaningfully affect performance. A researcher runs 50 evaluations. Compared to grid search over all 6 parameters, random search would most likely:

APerform worse because it does not evaluate every combination systematically

BPerform comparably because both methods sample the same number of configurations

CFind better learning rate and batch size values because it explores more distinct values of those important dimensions per evaluation

DOnly outperform grid search for deep learning models, not other model types

Question 2 Multiple Choice

What distinguishes Bayesian optimization from both grid and random search in how it selects configurations to evaluate?

AIt evaluates every combination in the hyperparameter space exhaustively before reporting results

BIt samples configurations randomly but then applies a filter to remove obviously bad ones

CIt builds a probabilistic surrogate model of the performance landscape and uses an acquisition function to direct evaluations toward promising regions

DIt fixes the least important hyperparameters first and then exhaustively searches the remaining ones

Question 3 True / False

Random search is almost seldom better than grid search for hyperparameter optimization because grid search is exhaustive and therefore expected to find the optimal combination.

TTrue

FFalse

Question 4 True / False

Bayesian optimization uses an acquisition function to balance exploring uncertain regions of the hyperparameter space against exploiting regions already known to perform well.

TTrue

FFalse

Question 5 Short Answer

Why does Bayesian optimization typically require fewer training runs than random search to find a high-performing hyperparameter configuration, and when is this advantage most valuable?

Think about your answer, then reveal below.

Questions: Hyperparameter Optimization