A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

Multiple Regression

College Depth 109 in the knowledge graph ☐ I know this ☆ Set as goal

144topics build on this

574prerequisites beneath it

Classical OLS Assumptions (Gauss-Markov)Expected Value: Theory and Properties +6 more→→Arbitrage Pricing Theory (APT) and Factor Models Cross-Validation and Out-of-Sample Model Evaluation +17 more

Core Idea

Multiple regression extends OLS to include several explanatory variables: y = β₀ + β₁x₁ + β₂x₂ + … + βₖxₖ + u. Each coefficient βⱼ represents the partial effect of xⱼ on y holding all other regressors constant — this 'ceteris paribus' interpretation is the central analytical payoff. In matrix form, the estimator is β̂ = (X'X)⁻¹X'y, which requires (X'X) to be invertible (no perfect multicollinearity). Adding control variables changes coefficient estimates if and only if those controls are correlated with both the dependent variable and the included regressors.

How It's Best Learned

Compare simple and multiple regression estimates on the same dataset — seeing how the wage coefficient on education changes when experience is added illustrates what 'holding constant' means in practice.

Common Misconceptions

More control variables do not always improve estimation — including irrelevant variables reduces efficiency and including endogenous controls can introduce new bias.
The coefficient on x₁ does not represent the effect of x₁ alone; it is always conditional on the other included variables.

Explainer

You already know bivariate regression: a single explanatory variable x₁ predicts y via ŷ = β̂₀ + β̂₁x₁, with OLS minimizing the sum of squared residuals. Multiple regression extends this to k explanatory variables — y = β₀ + β₁x₁ + β₂x₂ + … + βₖxₖ + u — and the conceptual payoff is enormous. Including additional regressors allows each coefficient to represent a partial effect: β₁ is the estimated change in y for a one-unit increase in x₁ *holding all other regressors constant*. This "ceteris paribus" interpretation is what lets economists isolate the effect of one variable from the confounding influence of others.

The wage-education example makes the logic concrete. A bivariate regression of wages on education gives a coefficient that captures not just education's direct effect but also any correlation between education and other determinants of wages (like experience or family background). When you add experience to the model, the education coefficient changes — and that change is informative. It tells you that part of the original estimate was actually attributable to the correlation between education and experience. The new coefficient is the effect of education among workers with the same years of experience.

In matrix notation, the OLS estimator is β̂ = (X'X)⁻¹X'y, where X is the n × (k+1) matrix of regressors including the constant column, and y is the n × 1 outcome vector. This formula generalizes the bivariate formula and makes the required conditions explicit: (X'X) must be invertible, which fails under perfect multicollinearity. You have seen matrix inverses in your prerequisites; here the condition det(X'X) ≠ 0 is the non-redundancy requirement — no regressor can be an exact linear combination of the others.

The "more controls is always better" intuition is wrong and important to resist. Adding a variable changes coefficient estimates only if it is correlated with both the outcome and the included regressors. Adding a truly irrelevant variable leaves coefficients unchanged in expectation but inflates their standard errors, reducing your ability to detect real effects. Adding an endogenous variable — one caused by your regressor — can introduce bias that wasn't there before, a phenomenon you'll study deeply when you reach omitted variable bias and simultaneity.

Multiple regression is the workhorse of empirical economics. From here, you'll study how to test whether a group of coefficients is jointly significant (F-tests), what happens when you omit a relevant variable, and how to handle categorical variables with dummies. Every one of those topics is an extension of the partial-effect logic you are building here.

Practice Questions 3 questions

Prerequisite Chain

Understanding Zero → The Number Zero → Counting to Five → Counting to 10 → Counting to 20 → Counting a Set of Objects Up to 20 → Cardinality: The Last Number Counted → Matching Numerals to Quantities → Subitizing Small Quantities → Addition Within 10 → Number Bonds to 10 → Addition Within 20 → Doubles and Near Doubles → Doubles Facts Within 10 → Near Doubles Facts Within 20 → Mental Math Strategies for Addition → Mental Math: Adding and Subtracting Tens → Addition Within 100 → Repeated Addition as Multiplication → Multiplication as Equal Groups → Multiplication: Arrays → Basic Multiplication Facts (0s, 1s, 2s, 5s, 10s) → Multiplication Facts Within 100 → Division as Equal Sharing → Division as Grouping (Measurement Division) → Division: Grouping (Repeated Subtraction) Model → Division: Fair Sharing Model → Division as Equal Sharing → Division as Grouping → Basic Division Facts → Division Facts Within 100 → Multiplication and Division Fact Families → Relationship Between Multiplication and Division → Division Facts as Inverse of Multiplication → Remainders and Quotients in Division → Division Word Problems → Multi-Step Word Problems → Solving Multi-Step Word Problems → Multiplication Word Problems → Division Word Problems → Introduction to Long Division → Factors and Multiples → Prime and Composite Numbers → Equivalent Fractions → Relating Fractions and Decimals → Decimal Place Value → Integers and the Number Line → Comparing and Ordering Integers → Absolute Value → Adding Integers → Subtracting Integers → Multiplying Integers → Dividing Integers → Unit Rates → Proportions → Percent Concept → Converting Between Fractions, Decimals, and Percents → Operations with Rational Numbers → Two-Step Equations → Solving Multi-Step Equations → Equations with Variables on Both Sides → Angle Pairs: Complementary, Supplementary, and Vertical → Parallel Lines and Transversals → Corresponding Angles → Alternate Interior Angles → Triangle Angle Sum Theorem → Exterior Angle Theorem → Triangle Inequality Theorem → Similar Triangles: AA Similarity → Similar Triangles: SSS and SAS Similarity → Proportions in Similar Triangles → Right Triangle Trigonometry Introduction → Sine, Cosine, and Tangent Ratios → Trigonometric Ratios Review → Radian Measure → Converting Between Degrees and Radians → The Unit Circle → Graphing Sine and Cosine → Graphing Tangent and Reciprocal Trigonometric Functions → Derivatives of Trigonometric Functions → Antiderivatives → Indefinite Integrals → Basic Integration Rules → Riemann Sums → Definite Integral Definition → Probability Density Functions and Continuous Distributions → Cumulative Distribution Functions → Continuous Random Variables → Probability Density Functions → Expected Value → Weak Law of Large Numbers → Probability Axioms and Rules → Conditional Probability → Independence of Events → Sampling Distributions → Standard Error of Estimators → Hypothesis Testing: Framework and Logic → P-values and Statistical Significance → Effect Size and Practical Significance → Hypothesis Testing: Framework and Logic → Z-Tests and T-Tests for Means → One-Sample Z-Test for Means → One-Sample and Two-Sample T-Tests → Inference in Linear Regression → Prediction Intervals in Regression → Linear Regression Basics → Residuals and Goodness of Fit (R²) → Simple (Bivariate) OLS Regression → Classical OLS Assumptions (Gauss-Markov) → Multiple Regression

Longest path: 110 steps · 574 total prerequisite topics

Prerequisites (8)

Simple (Bivariate) OLS Regressionhard Classical OLS Assumptions (Gauss-Markov)hard Linear Transformationshard Expected Value: Theory and Propertieshard Matrices Introductionsoft Matrix Operationssoft Invertible Matrices and Matrix Inversessoft Linear Regression and Least Squares Estimationsoft

Leads To (19)

Arbitrage Pricing Theory (APT) and Factor Modelssoft Cross-Validation and Out-of-Sample Model Evaluationhard Dummy Variables and Categorical Regressorshard Information Criteria: AIC and BIC for Model Selectionhard Interaction Terms in Regressionhard Interpreting Regression Coefficientshard Lagged Dependent Variable Regressionhard Logit and Probit Models for Binary Outcomeshard Model Specification Testing and Diagnosticshard Multicollinearityhard Omitted Variable Biashard Panel Data: Structure and Advantageshard Polynomial Regression and Nonlinear Functional Formshard Prediction Intervals and Out-of-Sample Forecastingsoft Quantile Regression and Distributional Effectshard Ridge, Lasso, and Elastic Net Regressionhard Specification Error: RESET Testhard Two-Stage Least Squares (2SLS)hard Variance Inflation Factor and Multicollinearity Diagnosissoft