← Graph View All Domains

A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

Classical OLS Assumptions (Gauss-Markov)

College Depth 108 in the knowledge graph ☐ I know this ☆ Set as goal

159topics build on this

570prerequisites beneath it

See this on the map →

Expected Value Random Variables +3 more→→Autocorrelation: Structure and Sources Endogeneity +14 more

Core Idea

The Gauss-Markov theorem states that OLS is the Best Linear Unbiased Estimator (BLUE) when six classical assumptions hold: linearity in parameters, random sampling, no perfect multicollinearity, zero conditional mean of errors (E[u|x]=0), homoskedasticity, and no serial correlation. The most critical assumption is E[u|x]=0, which requires that all determinants of y omitted from the model are uncorrelated with x. When this assumption fails — due to omitted variables, measurement error, or simultaneity — OLS estimates are biased and inconsistent. The remaining assumptions govern efficiency rather than unbiasedness.

How It's Best Learned

Work through examples of each assumption violation — simulate data with heteroskedastic errors, then see how OLS still estimates coefficients correctly (unbiased) but standard errors are wrong. This separates biasedness from inefficiency.

Common Misconceptions

Violating homoskedasticity biases standard errors, not coefficients — a common confusion.
The 'linearity' assumption applies to parameters (β), not to the functional form of x; including x² is still 'linear in parameters'.

Explainer

When you learned bivariate regression, you found a formula that fits a line through data. The Gauss-Markov theorem tells you when that line can be trusted as more than a description of the sample — specifically, when OLS is the Best Linear Unbiased Estimator (BLUE) for the population parameters. Understanding the theorem means understanding which assumptions are doing what.

The six classical assumptions can be grouped by what they protect. The first three — linearity in parameters, random sampling, and no perfect multicollinearity — are structural requirements that make estimation possible at all. If the model is nonlinear in parameters, or if two regressors are perfectly collinear, OLS simply cannot produce a unique solution. These assumptions are often satisfied by construction.

The fourth assumption, E[u|x] = 0, is the most critical and the most likely to fail. It says that the expected value of the error term, conditional on x, is zero — in other words, knowing x tells you nothing about the average size of the unobserved factors in u. This is the exogeneity condition. It fails whenever an omitted variable is correlated with x (omitted variable bias), when x is measured with error (attenuation bias), or when x and y jointly determine each other (simultaneity). When E[u|x] ≠ 0, the coefficient estimates are biased and inconsistent — no amount of additional data will fix the problem.

The fifth and sixth assumptions — homoskedasticity (constant error variance) and no serial correlation — govern efficiency, not unbiasedness. When these fail, OLS remains unbiased and consistent, but it is no longer the minimum-variance estimator among linear unbiased estimators. In practice, heteroskedasticity is extremely common (error variance often grows with income, firm size, or other scale variables), and the fix is straightforward: use heteroskedasticity-robust standard errors. The coefficients themselves are kept; only the standard errors are corrected.

A common confusion arises from the word "linearity" in the first assumption. The linearity requirement applies to the parameters β — the model must be linear in β — not to the functional form of the regressors. A model with x, x², and log(x) on the right-hand side is perfectly linear in parameters and satisfies the assumption. This flexibility means OLS can handle a wide range of nonlinear relationships between y and x, as long as the model remains linear in the unknowns you are estimating.

Practice Questions 3 questions

Prerequisite Chain

Understanding Zero → The Number Zero → Counting to Five → Counting to 10 → Counting to 20 → Counting a Set of Objects Up to 20 → Cardinality: The Last Number Counted → Matching Numerals to Quantities → Subitizing Small Quantities → Addition Within 10 → Number Bonds to 10 → Addition Within 20 → Doubles and Near Doubles → Doubles Facts Within 10 → Near Doubles Facts Within 20 → Mental Math Strategies for Addition → Mental Math: Adding and Subtracting Tens → Addition Within 100 → Repeated Addition as Multiplication → Multiplication as Equal Groups → Multiplication: Arrays → Basic Multiplication Facts (0s, 1s, 2s, 5s, 10s) → Multiplication Facts Within 100 → Division as Equal Sharing → Division as Grouping (Measurement Division) → Division: Grouping (Repeated Subtraction) Model → Division: Fair Sharing Model → Division as Equal Sharing → Division as Grouping → Basic Division Facts → Division Facts Within 100 → Multiplication and Division Fact Families → Relationship Between Multiplication and Division → Division Facts as Inverse of Multiplication → Remainders and Quotients in Division → Division Word Problems → Multi-Step Word Problems → Solving Multi-Step Word Problems → Multiplication Word Problems → Division Word Problems → Introduction to Long Division → Factors and Multiples → Prime and Composite Numbers → Equivalent Fractions → Relating Fractions and Decimals → Decimal Place Value → Integers and the Number Line → Comparing and Ordering Integers → Absolute Value → Adding Integers → Subtracting Integers → Multiplying Integers → Dividing Integers → Unit Rates → Proportions → Percent Concept → Converting Between Fractions, Decimals, and Percents → Operations with Rational Numbers → Two-Step Equations → Solving Multi-Step Equations → Equations with Variables on Both Sides → Angle Pairs: Complementary, Supplementary, and Vertical → Parallel Lines and Transversals → Corresponding Angles → Alternate Interior Angles → Triangle Angle Sum Theorem → Exterior Angle Theorem → Triangle Inequality Theorem → Similar Triangles: AA Similarity → Similar Triangles: SSS and SAS Similarity → Proportions in Similar Triangles → Right Triangle Trigonometry Introduction → Sine, Cosine, and Tangent Ratios → Trigonometric Ratios Review → Radian Measure → Converting Between Degrees and Radians → The Unit Circle → Graphing Sine and Cosine → Graphing Tangent and Reciprocal Trigonometric Functions → Derivatives of Trigonometric Functions → Antiderivatives → Indefinite Integrals → Basic Integration Rules → Riemann Sums → Definite Integral Definition → Probability Density Functions and Continuous Distributions → Cumulative Distribution Functions → Continuous Random Variables → Probability Density Functions → Expected Value → Weak Law of Large Numbers → Probability Axioms and Rules → Conditional Probability → Independence of Events → Sampling Distributions → Standard Error of Estimators → Hypothesis Testing: Framework and Logic → P-values and Statistical Significance → Effect Size and Practical Significance → Hypothesis Testing: Framework and Logic → Z-Tests and T-Tests for Means → One-Sample Z-Test for Means → One-Sample and Two-Sample T-Tests → Inference in Linear Regression → Prediction Intervals in Regression → Linear Regression Basics → Residuals and Goodness of Fit (R²) → Simple (Bivariate) OLS Regression → Classical OLS Assumptions (Gauss-Markov)

Longest path: 109 steps · 570 total prerequisite topics

Prerequisites (5)

Simple (Bivariate) OLS Regressionhard Expected Valuehard Random Variableshard Variance and Standard Deviation of Random Variablessoft Normal Distributionsoft

Leads To (16)

Autocorrelation: Structure and Sourceshard Endogeneityhard Estimator Properties: Consistency, Unbiasedness, and Efficiencyhard Gauss-Markov Theorem and OLS Efficiencyhard Generalized Least Squares (GLS) for Non-Spherical Errorshard Heteroskedasticityhard Heteroskedasticity: Types and Causeshard Least Squares Regression: Fundamentals and Derivationhard Maximum Likelihood Estimationhard Measurement Error and Its Consequenceshard Missing Data: Mechanisms and Analytical Solutionssoft Multiple Regressionhard Omitted Variable Biashard Returns to Education (Mincer Equation)soft Serial Correlation (Autocorrelation) in Regressionhard Standard Error Calculation and Correction Methodshard