Questions — Advanced Ensemble Methods

Question 1 Multiple Choice

An engineer builds an ensemble by training 100 decision trees on the exact same training data with no randomization, then averages their predictions. She expects significant performance gains over a single tree. What is the flaw in her reasoning?

AMore trees always improve performance regardless of how they are trained; the approach is valid

BWithout diversity, all trees make the same errors, so averaging them produces the same wrong answer more confidently rather than canceling errors

CEnsembles only work with fewer than 10 base models; 100 trees creates too much variance

DThe ensemble will improve bias but is guaranteed to increase variance, worsening overall performance

Question 2 Multiple Choice

A boosted model achieves near-perfect training accuracy after 500 boosting rounds but performs much worse on the test set. The most likely explanation is:

ABoosting sequentially corrects errors, and after enough rounds it can fit noise in the training data, leading to overfitting

BBagging was inadvertently applied instead of boosting, causing the base learners to underfit

CThe base learners were too diverse, causing their corrections to cancel each other out

DBoosting only reduces variance, not bias, so it cannot explain training accuracy improvements

Question 3 True / False

Bagging primarily reduces variance by training multiple models on different random subsets of the training data and averaging their predictions, which cancels out uncorrelated errors.

TTrue

FFalse

Question 4 True / False

Because boosting trains models sequentially, each one explicitly correcting the previous ensemble's errors, it is inherently more resistant to overfitting than bagging.

TTrue

FFalse

Question 5 Short Answer

Why is diversity among base learners the fundamental requirement for ensemble methods to work? What happens when diversity is absent?

Think about your answer, then reveal below.

Questions: Advanced Ensemble Methods