A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

Preregistration and Research Transparency Planning

College Depth 111 in the knowledge graph ☐ I know this ☆ Set as goal

558prerequisites beneath it

Exploratory and Confirmatory Analysis Strategies and Their Distinct Roles Forming Testable Hypotheses +3 more→

Core Idea

Preregistration involves documenting research hypotheses, design decisions, and analytical plans before data collection, creating a public record that distinguishes confirmatory hypothesis testing from exploratory analysis. Preregistration reduces researcher degrees of freedom—the flexibility in decision-making that can inflate false positive rates and effect size estimates through p-hacking and HARKing (Hypothesizing After Results are Known). Open science practices including preregistration, open data, and open code enhance transparency and reproducibility. Preregistration is particularly valuable in exploratory research and when researchers have many possible analytical choices.

How It's Best Learned

Write a detailed preregistration document for a hypothetical study, specifying all design, measurement, and analytical decisions before data collection.

Common Misconceptions

Preregistration is only for confirmatory studies (actually, it is valuable for both exploratory and confirmatory research). Preregistration prevents all flexibility in analysis (actually, sensitivity analyses and robustness checks can still occur; preregistration just distinguishes them from primary analyses).

Explainer

From hypothesis formation, you know how to construct a testable, grounded, directional hypothesis. From open science and research ethics, you know that psychology has faced a replication crisis — many published findings fail when independent labs attempt to reproduce them. Preregistration addresses a root cause of that crisis: not deliberate fraud, but the quiet inflation of false positives that happens when researchers have too many undisclosed choices during analysis.

The key concept is researcher degrees of freedom: the range of legitimate-seeming analytical decisions available at each step — which participants to exclude as outliers, whether to log-transform a skewed variable, which covariates to control for, which of several collected dependent variables to report, whether to run one more participant after a near-significant result. No single choice is obviously wrong. The problem is what happens when a researcher (consciously or not) cycles through combinations until something reaches p < .05 and then reports only that analysis as if it were the only one tried. The nominal α = .05 threshold no longer means what it claims. Each additional analytical choice is a fork in the road; if you walk enough forks and report only the significant path, you will find significance even in noise. Simulations show that with just a handful of unconstrained analytical choices, the true false positive rate can exceed 60% while appearing to be 5%.

Preregistration is the prophylactic: by documenting your hypothesis, design, and analysis plan in a public registry *before* data collection, you bind yourself. The timestamp proves the hypothesis existed before the data. A preregistered analysis is confirmatory: the test was specified in advance, so the p-value is interpretable at face value — a false positive rate of 5% really means 5%. Any analysis not in the preregistration is exploratory: interesting, potentially hypothesis-generating, but not confirmatory. The critical move is not eliminating flexibility but making the distinction *transparent to readers*. You can still run exploratory analyses; you just label them honestly.

HARKing — Hypothesizing After Results are Known — is the specific abuse that preregistration prevents most directly. A researcher runs an exploratory analysis, finds an unexpected significant effect, then writes the paper as if that was the hypothesis all along. The finding looks like a confirmatory test but is really an exploratory one. The study's false positive rate is not the nominal α but something much higher, because the hypothesis was selected precisely because it was significant. Preregistration timestamps the hypothesis before the data exist, making HARKing impossible — or at minimum visible as a deviation from the registered plan. Preregistration doesn't change what you find; it changes what your findings *mean*. A p = .03 in a preregistered study is strong evidence; a p = .03 that emerged from twenty undisclosed analysis variants is considerably weaker, regardless of what the paper claims.

Practice Questions 5 questions

Prerequisite Chain

Understanding Zero → The Number Zero → Counting to Five → Counting to 10 → Counting to 20 → Counting a Set of Objects Up to 20 → Cardinality: The Last Number Counted → Matching Numerals to Quantities → Subitizing Small Quantities → Addition Within 10 → Number Bonds to 10 → Addition Within 20 → Doubles and Near Doubles → Doubles Facts Within 10 → Near Doubles Facts Within 20 → Mental Math Strategies for Addition → Mental Math: Adding and Subtracting Tens → Addition Within 100 → Repeated Addition as Multiplication → Multiplication as Equal Groups → Multiplication: Arrays → Basic Multiplication Facts (0s, 1s, 2s, 5s, 10s) → Multiplication Facts Within 100 → Division as Equal Sharing → Division as Grouping (Measurement Division) → Division: Grouping (Repeated Subtraction) Model → Division: Fair Sharing Model → Division as Equal Sharing → Division as Grouping → Basic Division Facts → Division Facts Within 100 → Multiplication and Division Fact Families → Relationship Between Multiplication and Division → Division Facts as Inverse of Multiplication → Remainders and Quotients in Division → Division Word Problems → Multi-Step Word Problems → Solving Multi-Step Word Problems → Multiplication Word Problems → Division Word Problems → Introduction to Long Division → Factors and Multiples → Prime and Composite Numbers → Equivalent Fractions → Relating Fractions and Decimals → Decimal Place Value → Integers and the Number Line → Comparing and Ordering Integers → Absolute Value → Adding Integers → Subtracting Integers → Multiplying Integers → Dividing Integers → Unit Rates → Proportions → Percent Concept → Converting Between Fractions, Decimals, and Percents → Operations with Rational Numbers → Two-Step Equations → Solving Multi-Step Equations → Equations with Variables on Both Sides → Angle Pairs: Complementary, Supplementary, and Vertical → Parallel Lines and Transversals → Corresponding Angles → Alternate Interior Angles → Triangle Angle Sum Theorem → Exterior Angle Theorem → Triangle Inequality Theorem → Similar Triangles: AA Similarity → Similar Triangles: SSS and SAS Similarity → Proportions in Similar Triangles → Right Triangle Trigonometry Introduction → Sine, Cosine, and Tangent Ratios → Trigonometric Ratios Review → Radian Measure → Converting Between Degrees and Radians → The Unit Circle → Graphing Sine and Cosine → Graphing Tangent and Reciprocal Trigonometric Functions → Derivatives of Trigonometric Functions → Antiderivatives → Indefinite Integrals → Basic Integration Rules → Riemann Sums → Definite Integral Definition → Probability Density Functions and Continuous Distributions → Cumulative Distribution Functions → Continuous Random Variables → Probability Density Functions → Expected Value → Weak Law of Large Numbers → Probability Axioms and Rules → Conditional Probability → Conditional Distributions → Bivariate Normal Distribution → Normal Distribution → Standard Normal Distribution and Z-Scores → Hypothesis Testing Fundamentals → Experimental Research Design → Control and Experimental Groups → Random Assignment → Confounding Variables and Internal Validity → Blinding and Demand Characteristics → Validity in Psychological Measurement → Inferential Statistics in Psychology → Effect Size and Statistical Power → Effect Size Reporting and Practical Interpretation → Type I and Type II Error Trade-offs in Decision Making → Multiple Comparisons Problem and Correction Methods → Multiple Comparisons and Type I Error Rate Control → Exploratory and Confirmatory Analysis Strategies and Their Distinct Roles → Preregistration and Research Transparency Planning

Longest path: 112 steps · 558 total prerequisite topics

Prerequisites (5)

Forming Testable Hypotheseshard Exploratory and Confirmatory Analysis Strategies and Their Distinct Roleshard Replication and the Open Science Movementsoft Ethics in Psychological Researchsoft Analysis Planning and Preregistration of Hypothesessoft

Leads To (0)

No topics depend on this one yet.