A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

Hypothesis Testing in Regression

College Depth 111 in the knowledge graph ☐ I know this ☆ Set as goal

123topics build on this

577prerequisites beneath it

Confidence Intervals for Means Hypothesis Testing Fundamentals +4 more→→Asymptotic Normality of Regression Estimators Bootstrap Methods for Statistical Inference +9 more

Core Idea

In regression, each coefficient β̂ⱼ has an associated standard error se(β̂ⱼ), and the t-statistic t = (β̂ⱼ − β₀)/se(β̂ⱼ) tests whether βⱼ equals some hypothesized value (usually zero) in the population. Under the null, this t-statistic follows a t-distribution with n−k−1 degrees of freedom; for large samples it approaches the standard normal. Statistical significance at the 5% level means the p-value is below 0.05, but economic significance — whether the effect size matters practically — is a separate judgment. Confidence intervals for coefficients convey both magnitude and precision.

How It's Best Learned

Interpret regression tables from published papers, explaining each coefficient's sign, magnitude, standard error, and significance level. Practice constructing confidence intervals manually from reported standard errors.

Common Misconceptions

A statistically significant coefficient may be economically trivial, especially in large samples.
Failing to reject the null does not prove the null is true — it may reflect low power from a small sample or high variance.

Explainer

When you learned t-tests for comparing means, you computed a test statistic by dividing an estimate by its standard error: t = (x̄ − μ₀)/se(x̄). Hypothesis testing in regression is exactly the same idea applied to a regression coefficient. The OLS estimator β̂ⱼ is a random variable with a sampling distribution — it varies across hypothetical repeated samples. The standard error se(β̂ⱼ) measures how much β̂ⱼ varies across those samples. The t-statistic t = (β̂ⱼ − β₀)/se(β̂ⱼ) measures how many standard errors the estimate lies from the hypothesized value β₀ (usually zero). Under the null, this follows a t-distribution with n − k − 1 degrees of freedom; in large samples it is approximately standard normal.

Reading a regression table fluently means interpreting four things for each coefficient: its sign (direction of effect), its magnitude (how large is the effect), its standard error (how precisely estimated), and its p-value or significance stars (whether you would see this estimate by chance under H₀). The stars tell you whether the effect is statistically distinguishable from zero; they do not tell you whether the effect is large enough to matter. Every published table reports these together, and conflating them is one of the most common errors in applied work.

The statistical vs. economic significance distinction is the central lesson of this topic, and it is non-obvious. With a large enough sample, even a microscopic true effect will produce a tiny standard error, generating a significant p-value. An economist studying wages with n = 2,000,000 records might find that having a window seat at work raises wages by $0.12 per year with p < 0.001. That effect is real — it is not due to noise — but it is economically meaningless. Conversely, with n = 50 observations, a genuinely large effect may fail to reach significance simply because the sample is too small to detect it. Always report effect sizes alongside significance, and ask whether the magnitude of β̂ is large enough to matter for any real decision.

What does it actually mean when a result is significant at the 5% level? It means that if the null hypothesis were true, you would observe a t-statistic at least this large only 5% of the time by chance. It does not mean there is a 95% chance the null is false, and it does not mean the coefficient is 'probably' the right sign. The p-value is a statement about what you would observe in hypothetical repeated sampling under H₀ — not a probability assigned to the null hypothesis being true or false.

Confidence intervals are often more informative than p-values. A 95% confidence interval for β̂ⱼ is approximately β̂ⱼ ± 1.96 × se(β̂ⱼ). This interval gives you the range of effect sizes consistent with the data, rather than forcing a binary significant/not-significant verdict. A very wide confidence interval that just excludes zero is technically significant but tells you almost nothing about the true effect size. A narrow interval centered on a small value tells you the effect is real and small. Reporting and interpreting confidence intervals is the best habit to develop for honest empirical work.

Practice Questions 3 questions

Prerequisite Chain

Understanding Zero → The Number Zero → Counting to Five → Counting to 10 → Counting to 20 → Counting a Set of Objects Up to 20 → Cardinality: The Last Number Counted → Matching Numerals to Quantities → Subitizing Small Quantities → Addition Within 10 → Number Bonds to 10 → Addition Within 20 → Doubles and Near Doubles → Doubles Facts Within 10 → Near Doubles Facts Within 20 → Mental Math Strategies for Addition → Mental Math: Adding and Subtracting Tens → Addition Within 100 → Repeated Addition as Multiplication → Multiplication as Equal Groups → Multiplication: Arrays → Basic Multiplication Facts (0s, 1s, 2s, 5s, 10s) → Multiplication Facts Within 100 → Division as Equal Sharing → Division as Grouping (Measurement Division) → Division: Grouping (Repeated Subtraction) Model → Division: Fair Sharing Model → Division as Equal Sharing → Division as Grouping → Basic Division Facts → Division Facts Within 100 → Multiplication and Division Fact Families → Relationship Between Multiplication and Division → Division Facts as Inverse of Multiplication → Remainders and Quotients in Division → Division Word Problems → Multi-Step Word Problems → Solving Multi-Step Word Problems → Multiplication Word Problems → Division Word Problems → Introduction to Long Division → Factors and Multiples → Prime and Composite Numbers → Equivalent Fractions → Relating Fractions and Decimals → Decimal Place Value → Integers and the Number Line → Comparing and Ordering Integers → Absolute Value → Adding Integers → Subtracting Integers → Multiplying Integers → Dividing Integers → Unit Rates → Proportions → Percent Concept → Converting Between Fractions, Decimals, and Percents → Operations with Rational Numbers → Two-Step Equations → Solving Multi-Step Equations → Equations with Variables on Both Sides → Angle Pairs: Complementary, Supplementary, and Vertical → Parallel Lines and Transversals → Corresponding Angles → Alternate Interior Angles → Triangle Angle Sum Theorem → Exterior Angle Theorem → Triangle Inequality Theorem → Similar Triangles: AA Similarity → Similar Triangles: SSS and SAS Similarity → Proportions in Similar Triangles → Right Triangle Trigonometry Introduction → Sine, Cosine, and Tangent Ratios → Trigonometric Ratios Review → Radian Measure → Converting Between Degrees and Radians → The Unit Circle → Graphing Sine and Cosine → Graphing Tangent and Reciprocal Trigonometric Functions → Derivatives of Trigonometric Functions → Antiderivatives → Indefinite Integrals → Basic Integration Rules → Riemann Sums → Definite Integral Definition → Probability Density Functions and Continuous Distributions → Cumulative Distribution Functions → Continuous Random Variables → Probability Density Functions → Expected Value → Weak Law of Large Numbers → Probability Axioms and Rules → Conditional Probability → Independence of Events → Sampling Distributions → Standard Error of Estimators → Hypothesis Testing: Framework and Logic → P-values and Statistical Significance → Effect Size and Practical Significance → Hypothesis Testing: Framework and Logic → Z-Tests and T-Tests for Means → One-Sample Z-Test for Means → One-Sample and Two-Sample T-Tests → Inference in Linear Regression → Prediction Intervals in Regression → Linear Regression Basics → Residuals and Goodness of Fit (R²) → Simple (Bivariate) OLS Regression → Classical OLS Assumptions (Gauss-Markov) → Multiple Regression → Interpreting Regression Coefficients → Hypothesis Testing in Regression

Longest path: 112 steps · 577 total prerequisite topics

Prerequisites (6)

Interpreting Regression Coefficientshard Hypothesis Testing Fundamentalshard One-Sample and Two-Sample T-Testshard P-values and Statistical Significancehard Confidence Intervals for Meanshard Hypothesis Testing: Framework and Logicsoft

Leads To (11)

Asymptotic Normality of Regression Estimatorshard Bootstrap Methods for Statistical Inferencehard Confidence Intervals and Hypothesis Tests in Regressionhard F-Test and Joint Significancehard Model Specification Testing and Diagnosticshard Randomized Experiments in Development Economicssoft Robust Standard Errorshard Specification Error: RESET Testhard Specification Tests: Ramsey RESET and Hausman Testshard Standard Error Calculation and Correction Methodshard T-Statistic for Individual Coefficientshard