Gauss-Markov Theorem and OLS Efficiency

Graduate Depth 85 in the knowledge graph I know this Set as goal
efficiency ols theorem

Core Idea

Under the Gauss-Markov assumptions (linearity, zero-mean errors, homoskedasticity, no autocorrelation, no perfect multicollinearity), OLS is the Best Linear Unbiased Estimator (BLUE)—it has the smallest variance among all linear unbiased estimators. This fundamental result justifies OLS as the primary estimation method in applied econometrics.

Explainer

When you learned least-squares regression, you learned how OLS works mechanically — it minimizes the sum of squared residuals to find coefficients. The Gauss-Markov theorem answers a different and deeper question: *why should you use OLS?* Given that infinitely many estimators could produce an estimate of a regression coefficient, the theorem says OLS is the best among a specific class — linear and unbiased — as long as five assumptions hold.

Let's unpack what BLUE actually means. "Linear" means the estimator is a linear function of the observed outcomes Y. "Unbiased" means the expected value of the estimate equals the true parameter: E[β̂] = β. Many estimators are unbiased — you could just pick one observation's Y/X ratio, and on average it might equal β. But unbiased isn't enough; you also want precision. "Best" means lowest variance among all linear unbiased estimators. The Gauss-Markov theorem says OLS achieves this minimum variance — no other estimator in this class is more efficient.

The five Gauss-Markov assumptions are conditions on the data-generating process, not on the sample. Linearity means the true model is linear in parameters (not necessarily in variables — you can include X² and still satisfy linearity). Zero-mean errors means Cov(X, u) = 0 — errors are uncorrelated with regressors, which is the exogeneity condition. Homoskedasticity means all errors have the same variance: Var(uᵢ) = σ² for all i. No autocorrelation means Cov(uᵢ, uⱼ) = 0 for i ≠ j. No perfect multicollinearity means the regressors aren't exact linear combinations of each other. Each assumption plays a specific role in the proof — violate one and a competing estimator can beat OLS.

What happens when assumptions fail is as instructive as when they hold. If errors are heteroskedastic (unequal variance), OLS is still unbiased but is no longer efficient — Generalized Least Squares (GLS), which weights observations by the inverse of their error variance, produces lower-variance estimates. If errors are autocorrelated, the same logic applies. If the zero-mean/exogeneity assumption fails — as in simultaneous equations or omitted variable bias — OLS is not even unbiased, let alone efficient, and instrumental variables become necessary. The Gauss-Markov theorem thus serves as a diagnostic framework: identify which assumption is violated, and the right alternative estimator follows directly from that diagnosis.

Practice Questions 5 questions

Prerequisite Chain

Counting to 10Counting to 20Understanding ZeroThe Number ZeroCounting to FiveOne-to-One CorrespondenceCombining Small Groups Within 5Addition Within 10Addition Within 20Two-Digit Addition Without RegroupingTwo-Digit Addition with RegroupingAddition Within 100Repeated Addition as MultiplicationMultiplication Facts Within 100Division as Equal SharingDivision as Grouping (Measurement Division)Division: Grouping (Repeated Subtraction) ModelDivision: Fair Sharing ModelDivision as Equal SharingDivision as GroupingBasic Division FactsDivision Facts Within 100Two-Digit by One-Digit DivisionDivision with RemaindersRemainders and Quotients in DivisionDivision Word ProblemsIntroduction to Long DivisionFactors and MultiplesPrime and Composite NumbersEquivalent FractionsRelating Fractions and DecimalsDecimal Place ValueReading and Writing DecimalsComparing and Ordering DecimalsAdding and Subtracting DecimalsMultiplying DecimalsDividing DecimalsDividing FractionsMixed Number ArithmeticOrder of OperationsInteger Order of OperationsVariable ExpressionsCombining Like TermsOne-Step EquationsTwo-Step EquationsSolving Multi-Step EquationsEquations with Variables on Both SidesAngle Pairs: Complementary, Supplementary, and VerticalParallel Lines and TransversalsCorresponding AnglesAlternate Interior AnglesTriangle Angle Sum TheoremExterior Angle TheoremTriangle Inequality TheoremSimilar Triangles: AA SimilaritySimilar Triangles: SSS and SAS SimilarityProportions in Similar TrianglesRight Triangle Trigonometry IntroductionTrigonometric Ratios ReviewRadian MeasureConverting Between Degrees and RadiansThe Unit CircleGraphing Sine and CosineGraphing Tangent and Reciprocal Trigonometric FunctionsDerivatives of Trigonometric FunctionsAntiderivativesIndefinite IntegralsBasic Integration RulesRiemann SumsDefinite Integral DefinitionProbability Density Functions and Continuous DistributionsCumulative Distribution FunctionsContinuous Random VariablesNormal DistributionCentral Limit TheoremConfidence Intervals for MeansZ-Tests and T-Tests for MeansOne-Sample Z-Test for MeansOne-Sample and Two-Sample T-TestsHypothesis Testing in RegressionSpecification Error: RESET TestWhite Test and Detection of HeteroskedasticityGeneralized Least Squares (GLS) for Non-Spherical ErrorsFeasible GLS (FGLS) with Estimated Covariance StructureLeast Squares Regression: Fundamentals and DerivationGauss-Markov Theorem and OLS Efficiency

Longest path: 86 steps · 484 total prerequisite topics

Prerequisites (3)

Leads To (0)

No topics depend on this one yet.