Questions: Expert Problem and Hedge Algorithm

4 questions to test your understanding

Score: 0 / 4
Question 1 Multiple Choice

You have N = 100 experts and must predict for T = 10,000 rounds. What is the optimal learning rate for Hedge, and what regret can you expect?

Aeta = 1/T = 0.0001, achieving regret O(T/ln(N)) ≈ 2,170
Beta = sqrt(ln(N)/T) = sqrt(ln(100)/10000) ≈ 0.021, achieving regret O(sqrt(T * ln N)) ≈ sqrt(10000 * 4.6) ≈ 215
Ceta = 1/sqrt(N) ≈ 0.1, achieving regret O(sqrt(N * T)) ≈ 1,000
Deta = ln(N)/T ≈ 0.00046, achieving regret O(T * sqrt(ln(N)/T)) ≈ 215
Question 2 True / False

The Hedge algorithm requires knowing the time horizon T in advance to set the optimal learning rate. Without this knowledge, the algorithm cannot achieve sublinear regret.

TTrue
FFalse
Question 3 True / False

If all N experts have identical cumulative loss after T rounds, the Hedge algorithm's regret is zero.

TTrue
FFalse
Question 4 Short Answer

Explain why the regret bound for the expert problem scales with ln(N) rather than N, and what this means for practical applicability with large expert pools.

Think about your answer, then reveal below.