Questions: Expert Problem and Hedge Algorithm

4 questions to test your understanding

Score: 0 / 4

Question 1 Multiple Choice

You have N = 100 experts and must predict for T = 10,000 rounds. What is the optimal learning rate for Hedge, and what regret can you expect?

Aeta = 1/T = 0.0001, achieving regret O(T/ln(N)) ≈ 2,170

Beta = sqrt(ln(N)/T) = sqrt(ln(100)/10000) ≈ 0.021, achieving regret O(sqrt(T * ln N)) ≈ sqrt(10000 * 4.6) ≈ 215

Ceta = 1/sqrt(N) ≈ 0.1, achieving regret O(sqrt(N * T)) ≈ 1,000

Deta = ln(N)/T ≈ 0.00046, achieving regret O(T * sqrt(ln(N)/T)) ≈ 215

Question 2 True / False

The Hedge algorithm requires knowing the time horizon T in advance to set the optimal learning rate. Without this knowledge, the algorithm cannot achieve sublinear regret.

TTrue

FFalse

Question 3 True / False

If all N experts have identical cumulative loss after T rounds, the Hedge algorithm's regret is zero.

TTrue

FFalse

Question 4 Short Answer

Explain why the regret bound for the expert problem scales with ln(N) rather than N, and what this means for practical applicability with large expert pools.

Think about your answer, then reveal below.