A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

Autoregressive (AR) Models and Order Selection

College Depth 115 in the knowledge graph ☐ I know this ☆ Set as goal

4topics build on this

865prerequisites beneath it

Unit Roots and Testing for Stationarity→→ARIMA Models and Time Series Forecasting Modeling Time-Varying Volatility with GARCH +1 more

Core Idea

An AR(p) model regresses a series on its own p lags: yₜ = c + φ₁yₜ₋₁ + ... + φₚyₜ₋ₚ + εₜ. Order p is chosen using ACF/PACF plots or information criteria; AR models capture temporal dependence and form the basis for ARMA and ARIMA extensions.

Explainer

You already know from stationarity that a well-behaved time series has statistical properties that don't drift over time. An AR model exploits exactly this property: if a series is stationary, its past values contain genuine predictive information about its current value. An autoregressive model of order p, written AR(p), formalizes this by regressing yₜ — today's value — on its own p most recent lags: yₜ = c + φ₁yₜ₋₁ + φ₂yₜ₋₂ + ... + φₚyₜ₋ₚ + εₜ. Think of GDP growth, which tends to persist: a strong quarter is more likely to be followed by another strong quarter than by a contraction. The φ coefficients capture exactly that persistence.

The trickiest part of AR modeling is choosing p — how many lags to include. Two diagnostic tools guide this. The autocorrelation function (ACF) measures the correlation between yₜ and yₜ₋ₖ for various lags k; it shows total correlation including indirect effects. The partial autocorrelation function (PACF) strips out those indirect effects and shows the unique contribution of each lag after controlling for shorter lags. For an AR(p) process, the PACF cuts off sharply after lag p while the ACF decays gradually. This contrast is your diagnostic: if the PACF drops to near zero after lag 2 and the ACF declines slowly, you're likely looking at an AR(2).

For a more formal approach, information criteria like AIC (Akaike) and BIC (Bayesian) balance fit against parsimony — BIC penalizes complexity more heavily than AIC. You fit models of various orders and choose the p that minimizes the criterion. The practical advice: start with the PACF plot to get a ballpark, then confirm with AIC/BIC, and prefer lower-order models unless higher-order ones show substantial improvement.

The stability of an AR model depends critically on the φ coefficients. You learned about unit roots when studying stationarity: if φ₁ = 1 in an AR(1), the series has a unit root and is nonstationary — past shocks never die out, and standard inference breaks down. For a stationary AR(p), all roots of the characteristic polynomial must lie outside the unit circle. In practice, this means the φ coefficients must be constrained appropriately. This is why stationarity testing comes before AR modeling — an AR model fitted to a nonstationary series produces spurious, uninterpretable results. Once you confirm stationarity (or difference to achieve it), AR models become powerful forecasting workhorses, forming the AR component of the more general ARIMA framework you'll encounter next.

Practice Questions 5 questions

Prerequisite Chain

Understanding Zero → The Number Zero → Counting to Five → Counting to 10 → Counting to 20 → Counting a Set of Objects Up to 20 → Cardinality: The Last Number Counted → Matching Numerals to Quantities → Subitizing Small Quantities → Addition Within 10 → Number Bonds to 10 → Addition Within 20 → Doubles and Near Doubles → Doubles Facts Within 10 → Near Doubles Facts Within 20 → Mental Math Strategies for Addition → Mental Math: Adding and Subtracting Tens → Addition Within 100 → Repeated Addition as Multiplication → Multiplication as Equal Groups → Multiplication: Arrays → Basic Multiplication Facts (0s, 1s, 2s, 5s, 10s) → Multiplication Facts Within 100 → Division as Equal Sharing → Division as Grouping (Measurement Division) → Division: Grouping (Repeated Subtraction) Model → Division: Fair Sharing Model → Division as Equal Sharing → Division as Grouping → Basic Division Facts → Division Facts Within 100 → Multiplication and Division Fact Families → Relationship Between Multiplication and Division → Division Facts as Inverse of Multiplication → Remainders and Quotients in Division → Division Word Problems → Multi-Step Word Problems → Solving Multi-Step Word Problems → Multiplication Word Problems → Division Word Problems → Introduction to Long Division → Factors and Multiples → Prime and Composite Numbers → Equivalent Fractions → Relating Fractions and Decimals → Decimal Place Value → Integers and the Number Line → Comparing and Ordering Integers → Absolute Value → Adding Integers → Subtracting Integers → Multiplying Integers → Dividing Integers → Unit Rates → Proportions → Percent Concept → Converting Between Fractions, Decimals, and Percents → Operations with Rational Numbers → Two-Step Equations → Solving Multi-Step Equations → Equations with Variables on Both Sides → Angle Pairs: Complementary, Supplementary, and Vertical → Parallel Lines and Transversals → Corresponding Angles → Alternate Interior Angles → Triangle Angle Sum Theorem → Exterior Angle Theorem → Triangle Inequality Theorem → Similar Triangles: AA Similarity → Similar Triangles: SSS and SAS Similarity → Proportions in Similar Triangles → Right Triangle Trigonometry Introduction → Sine, Cosine, and Tangent Ratios → Trigonometric Ratios Review → Radian Measure → Converting Between Degrees and Radians → The Unit Circle → Graphing Sine and Cosine → Graphing Tangent and Reciprocal Trigonometric Functions → Derivatives of Trigonometric Functions → Antiderivatives → Indefinite Integrals → Basic Integration Rules → Riemann Sums → Definite Integral Definition → Probability Density Functions and Continuous Distributions → Cumulative Distribution Functions → Continuous Random Variables → Probability Density Functions → Expected Value → Weak Law of Large Numbers → Probability Axioms and Rules → Conditional Probability → Independence of Events → Sampling Distributions → Standard Error of Estimators → Hypothesis Testing: Framework and Logic → P-values and Statistical Significance → Effect Size and Practical Significance → Hypothesis Testing: Framework and Logic → Z-Tests and T-Tests for Means → One-Sample Z-Test for Means → One-Sample and Two-Sample T-Tests → Inference in Linear Regression → Prediction Intervals in Regression → Linear Regression Basics → Residuals and Goodness of Fit (R²) → Simple (Bivariate) OLS Regression → Classical OLS Assumptions (Gauss-Markov) → Multiple Regression → Interpreting Regression Coefficients → Hypothesis Testing in Regression → F-Test and Joint Significance → Chow Test and Detection of Structural Breaks → Unit Roots and Testing for Stationarity → Autoregressive (AR) Models and Order Selection

Longest path: 116 steps · 865 total prerequisite topics

Prerequisites (1)

Unit Roots and Testing for Stationarityhard

Leads To (3)

ARIMA Models and Time Series Forecastinghard Modeling Time-Varying Volatility with GARCHsoft Vector Autoregression (VAR) Models and Impulse Responseshard