Confidence Intervals (Rigorous Theory)

Research Depth 93 in the knowledge graph I know this Set as goal
confidence-intervals interval-estimation statistics

Core Idea

A (1-α) confidence interval [L(X), U(X)] for θ satisfies P(L(X) ≤ θ ≤ U(X)) = 1 - α. Confidence intervals can be inverted from hypothesis tests: the (1-α) CI is {θ: θ is not rejected at level α}. Shortest confidence intervals use the critical region from the UMP test. Asymptotic CIs rely on asymptotic normality of estimators.

Explainer

From asymptotic normality of the MLE, you know that under regularity conditions √n(θ̂_MLE − θ) → N(0, I(θ)⁻¹) in distribution, where I(θ) is the Fisher information. This immediately suggests a confidence interval: rearrange the normal approximation to get θ̂ ± z_{α/2}/√(n·I(θ)). But to call this a *confidence interval* rigorously requires a precise definition — one with a subtlety that most introductory treatments skip.

A confidence interval [L(X), U(X)] is formally a pair of statistics (functions of data, not of the unknown θ) satisfying P_θ(L(X) ≤ θ ≤ U(X)) ≥ 1 − α for *all* θ in the parameter space. Notice the direction of randomness: θ is fixed (though unknown), and the interval is random because L and U depend on the data X. The statement "there is a 95% probability that θ lies in this interval" is literally false once data is observed — θ either is or is not in the realized interval. The correct interpretation is frequentist: if you repeated the experiment and CI construction many times, at least (1−α)·100% of the resulting intervals would contain the true θ.

The test inversion principle connects CIs to hypothesis testing. For each candidate value θ₀, consider the level-α test of H₀: θ = θ₀. Let A(θ₀) be the acceptance region of this test. Define the confidence set C(X) = {θ₀ : X ∈ A(θ₀)} — the set of all parameter values that the data would not reject. Then P_θ(θ ∈ C(X)) = P_θ(X ∈ A(θ)) = 1 − α, so C(X) is a valid (1−α) confidence set. This is an exact equality, not just an analogy. When the test is UMP (uniformly most powerful), the resulting CI is the shortest possible at that confidence level — test optimality translates directly into CI optimality.

Asymptotic CIs fill the practical gap. Exact CIs exist in closed form only for special families (exponential family, location-scale). For most problems, the MLE's asymptotic normality provides a universal construction: estimate I(θ) by I(θ̂), apply the asymptotic normal approximation, and invert. The resulting CI has correct coverage as n → ∞ but may deviate from 1−α in finite samples. More refined approaches — likelihood ratio inversion, bootstrap CIs — improve finite-sample coverage but share the same foundational structure: define coverage probability, connect to a test or pivotal quantity, exploit large-sample approximations.

Practice Questions 5 questions

Prerequisite Chain

Counting to 10Counting to 20Understanding ZeroThe Number ZeroCounting to FiveOne-to-One CorrespondenceCombining Small Groups Within 5Addition Within 10Addition Within 20Two-Digit Addition Without RegroupingTwo-Digit Addition with RegroupingAddition Within 100Repeated Addition as MultiplicationMultiplication Facts Within 100Division as Equal SharingDivision as Grouping (Measurement Division)Division: Grouping (Repeated Subtraction) ModelDivision: Fair Sharing ModelDivision as Equal SharingDivision as GroupingBasic Division FactsDivision Facts Within 100Two-Digit by One-Digit DivisionDivision with RemaindersRemainders and Quotients in DivisionDivision Word ProblemsIntroduction to Long DivisionFactors and MultiplesPrime and Composite NumbersEquivalent FractionsRelating Fractions and DecimalsDecimal Place ValueReading and Writing DecimalsComparing and Ordering DecimalsAdding and Subtracting DecimalsMultiplying DecimalsDividing DecimalsDividing FractionsMixed Number ArithmeticOrder of OperationsInteger Order of OperationsVariable ExpressionsCombining Like TermsOne-Step EquationsTwo-Step EquationsSolving Multi-Step EquationsEquations with Variables on Both SidesAngle Pairs: Complementary, Supplementary, and VerticalParallel Lines and TransversalsCorresponding AnglesAlternate Interior AnglesTriangle Angle Sum TheoremExterior Angle TheoremTriangle Inequality TheoremSimilar Triangles: AA SimilaritySimilar Triangles: SSS and SAS SimilarityProportions in Similar TrianglesRight Triangle Trigonometry IntroductionTrigonometric Ratios ReviewRadian MeasureConverting Between Degrees and RadiansThe Unit CircleGraphing Sine and CosineGraphing Tangent and Reciprocal Trigonometric FunctionsDerivatives of Trigonometric FunctionsAntiderivativesIndefinite IntegralsBasic Integration RulesRiemann SumsDefinite Integral DefinitionFundamental Theorem of Calculus Part 1Fundamental Theorem of Calculus Part 2U-SubstitutionPartial Fraction Decomposition for IntegrationImproper Integrals - ConvergenceIntegral TestP-SeriesComparison TestLimit Comparison TestAbsolute vs. Conditional ConvergencePower SeriesTaylor PolynomialsTaylor SeriesMoment Generating FunctionsCharacteristic FunctionsConvergence in DistributionStationary DistributionsConvergence of Markov ChainsConvergence in ProbabilityAlmost Sure ConvergenceStrong Law of Large NumbersCentral Limit Theorem (Rigorous via Characteristic Functions)Asymptotic Normality of MLEsConfidence Intervals (Rigorous Theory)

Longest path: 94 steps · 532 total prerequisite topics

Prerequisites (2)

Leads To (0)

No topics depend on this one yet.