Explain why the genome-wide significance threshold for GWAS is typically set at p < 5e-8 rather than the conventional p < 0.05.
Think about your answer, then reveal below.
Model answer: A GWAS typically tests approximately 1 million independent SNPs (after accounting for linkage disequilibrium) for association with a phenotype. At p < 0.05, this would produce ~50,000 false positives. The threshold of 5e-8 approximates a Bonferroni correction for 1 million tests (0.05 / 1,000,000 = 5e-8), controlling the genome-wide false positive rate at 5%. This stringent threshold means that only very strong statistical signals survive, requiring large sample sizes (often tens to hundreds of thousands of individuals) to detect the typically small effects of individual variants on complex traits.
The 5e-8 threshold has become a community standard for human GWAS. It is conservative (Bonferroni assumes independence, but SNPs in LD are correlated), but its stringency has proven valuable — GWAS results that pass this threshold replicate in independent cohorts at very high rates, demonstrating that the threshold effectively controls false discoveries.