← Graph View All Domains

A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

Multiplicative Weights Method

Research Depth 99 in the knowledge graph ☐ I know this ☆ Set as goal

1topic build on this

688prerequisites beneath it

See this on the map →

Online Learning and Regret Bounds Expected Value→→Expert Problem and Hedge Algorithm

Core Idea

The multiplicative weights (MW) method is a meta-algorithm for online decision-making where, in each round, the learner maintains a weight for each option and selects (or randomizes) proportionally to these weights. After observing the loss, weights of poorly performing options are multiplicatively decreased: w_i <- w_i * (1 - eta * loss_i). This achieves O(sqrt(T * ln N)) regret over T rounds with N options. The method is a universal primitive that appears independently across computer science — as the Hedge algorithm in online learning, the Winnow algorithm in machine learning, boosting in ensemble methods, and equilibrium computation in game theory.

Explainer

The multiplicative weights method is one of the most versatile algorithmic primitives in theoretical computer science. Its core idea is simple: maintain a weight for each option, use the weights to make decisions, then update by multiplicatively penalizing options that performed poorly. Despite this simplicity, the method achieves near-optimal regret bounds and appears as a key ingredient in algorithms across diverse fields.

The algorithm proceeds as follows. Initialize weights w_i = 1 for each of N options. At each round t: (1) Select option i with probability proportional to w_i, or deterministically choose the highest-weight option; (2) Observe losses l_1, ..., l_N for all options; (3) Update w_i <- w_i * (1 - eta * l_i) for each option, where eta is the learning rate. The multiplicative update means that options consistently performing poorly see their weights shrink exponentially — after k rounds of high loss, a weight is roughly (1 - eta)^k, which decays rapidly. Options consistently performing well maintain or grow their relative weight.

The regret analysis reveals why the method works. The key potential function is the total weight W_t = sum_i w_i^(t). On one hand, the total weight cannot decrease too fast because the learner randomizes proportionally to weights, linking the weight decrease to the learner's expected loss. On the other hand, the best expert's weight is at most W_T, providing a lower bound. Combining these bounds gives: learner's total loss <= (best expert's total loss) + (ln N)/eta + eta * T, and setting eta = sqrt(ln(N)/T) yields regret O(sqrt(T * ln N)). The logarithmic dependence on N is remarkable — with 1 million experts, the regret only grows by a factor of sqrt(ln(10⁶)) ≈ 3.7 compared to 2 experts.

The universality of multiplicative weights is its most striking feature. In online learning, it is the Hedge algorithm. In machine learning, AdaBoost's training-example weighting is MW applied to a game between the booster and the weak learner. In game theory, MW converges to minimax equilibria in zero-sum games (each player runs MW, and the time-averaged strategies converge to a Nash equilibrium). In combinatorial optimization, it appears in the Plotkin-Shmoys-Tardos framework for approximately solving linear programs. In information theory, it relates to universal coding and the exponential weights forecaster. This convergence of independently discovered techniques to the same multiplicative update is evidence that the method captures something fundamental about decision-making under uncertainty.

Practice Questions 4 questions

Prerequisite Chain

Understanding Zero → The Number Zero → Counting to Five → Counting to 10 → Counting to 20 → Counting a Set of Objects Up to 20 → Cardinality: The Last Number Counted → Matching Numerals to Quantities → Subitizing Small Quantities → Addition Within 10 → Number Bonds to 10 → Addition Within 20 → Doubles and Near Doubles → Doubles Facts Within 10 → Near Doubles Facts Within 20 → Mental Math Strategies for Addition → Mental Math: Adding and Subtracting Tens → Addition Within 100 → Repeated Addition as Multiplication → Multiplication as Equal Groups → Multiplication: Arrays → Basic Multiplication Facts (0s, 1s, 2s, 5s, 10s) → Multiplication Facts Within 100 → Division as Equal Sharing → Division as Grouping (Measurement Division) → Division: Grouping (Repeated Subtraction) Model → Division: Fair Sharing Model → Division as Equal Sharing → Division as Grouping → Basic Division Facts → Division Facts Within 100 → Multiplication and Division Fact Families → Relationship Between Multiplication and Division → Division Facts as Inverse of Multiplication → Remainders and Quotients in Division → Division Word Problems → Multi-Step Word Problems → Solving Multi-Step Word Problems → Multiplication Word Problems → Division Word Problems → Introduction to Long Division → Factors and Multiples → Prime and Composite Numbers → Equivalent Fractions → Relating Fractions and Decimals → Decimal Place Value → Integers and the Number Line → Comparing and Ordering Integers → Absolute Value → Adding Integers → Subtracting Integers → Multiplying Integers → Dividing Integers → Unit Rates → Proportions → Percent Concept → Converting Between Fractions, Decimals, and Percents → Operations with Rational Numbers → Two-Step Equations → Solving Multi-Step Equations → Equations with Variables on Both Sides → Angle Pairs: Complementary, Supplementary, and Vertical → Parallel Lines and Transversals → Corresponding Angles → Alternate Interior Angles → Triangle Angle Sum Theorem → Exterior Angle Theorem → Triangle Inequality Theorem → Similar Triangles: AA Similarity → Similar Triangles: SSS and SAS Similarity → Proportions in Similar Triangles → Right Triangle Trigonometry Introduction → Sine, Cosine, and Tangent Ratios → Trigonometric Ratios Review → Radian Measure → Converting Between Degrees and Radians → The Unit Circle → Graphing Sine and Cosine → Graphing Tangent and Reciprocal Trigonometric Functions → Derivatives of Trigonometric Functions → Antiderivatives → Indefinite Integrals → Basic Integration Rules → Riemann Sums → Definite Integral Definition → Probability Density Functions and Continuous Distributions → Cumulative Distribution Functions → Continuous Random Variables → Probability Density Functions → Expected Value → Linear Regression in Machine Learning → Neural Network Fundamentals → Backpropagation Algorithm → Multilayer Perceptrons (MLPs) → Activation Functions in Neural Networks → Vanishing Gradient Problem → Gradient Descent and Optimization → Convex Optimization Fundamentals → Online Learning and Regret Bounds → Multiplicative Weights Method

Longest path: 100 steps · 688 total prerequisite topics

Prerequisites (2)

Online Learning and Regret Boundshard Expected Valuesoft

Leads To (1)

Expert Problem and Hedge Algorithmhard