Questions: Linkage Disequilibrium and Evolutionary Dynamics
5 questions to test your understanding
Score: 0 / 5
Question 1 Multiple Choice
A genome-wide association study identifies a SNP marker strongly associated with Type 2 diabetes risk, but functional studies find no biological effect of that SNP itself. The most likely explanation is:
AThe GWAS statistical methods produced a false positive, and the association is not real
BThe marker SNP is in linkage disequilibrium with a nearby causal variant, so the association is a proxy signal
CThe marker SNP has a pleiotropic effect on diabetes through an unknown pathway
DThe association is confounded by population stratification in the study sample
GWAS exploits LD: a marker SNP that is in strong LD with a causal variant will show strong statistical association with the disease even if the marker itself has no biological function. The causal variant and the marker are inherited together on the same haplotype because recombination has not yet separated them. This is not a false positive — the association is real; the marker is just a proxy. Finding the true causal variant requires fine-mapping the LD block with denser SNP coverage or functional experiments.
Question 2 Multiple Choice
A human population experienced a severe genetic bottleneck 200 generations ago, followed by rapid expansion. Compared to an ancient, large population that has never bottlenecked, this population would be expected to have:
ALess linkage disequilibrium, because genetic drift broke up haplotype blocks during the bottleneck
BMore linkage disequilibrium across longer chromosomal stretches, because drift created new associations that have had limited time to decay
CThe same LD structure, because recombination rates are identical in both populations
DLess genetic variation overall but the same LD structure, since LD depends only on recombination rate
A bottleneck creates new LD through genetic drift — random sampling of a small founder population creates chance associations between alleles that may persist for many generations. With 200 generations since the bottleneck, recombination has had limited time to break up these haplotype blocks, especially for closely linked loci. The bottleneck population will have longer LD blocks (reflecting recent history) compared to the ancient population where recombination has been eroding LD over thousands of generations. This is why populations with recent bottlenecks (e.g., isolated island populations) often show extended LD useful for gene mapping.
Question 3 True / False
Linkage disequilibrium can mainly exist between alleles at loci that are physically located on the same chromosome.
TTrue
FFalse
Answer: False
While physical linkage (proximity on the same chromosome) is the most common cause of LD and produces the most persistent LD, genetic drift in small populations can create LD between alleles at unlinked loci — even loci on different chromosomes. In a small founding population, random sampling can cause alleles at independent loci to co-occur more often than expected by chance. This drift-generated LD typically decays rapidly (halving each generation for unlinked loci), but in very small populations or over short timescales it can be substantial. LD is therefore a property of allele frequency co-distributions, not just physical proximity.
Question 4 True / False
Over many generations in a large population, linkage disequilibrium between two loci tends to decrease, with tightly linked loci losing LD more slowly than distantly linked ones.
TTrue
FFalse
Answer: True
LD decays at a rate of (1 − r) per generation, where r is the recombination fraction between the two loci. For tightly linked loci (r ≈ 0), LD decays very slowly — nearly intact from generation to generation. For unlinked loci (r = 0.5), LD halves each generation and approaches zero quickly. This differential decay is what makes LD patterns informative: long haplotype blocks indicate either recent origin (not enough generations for recombination to break them) or tight physical linkage, while short blocks indicate ancient, well-mixed populations or high local recombination rates.
Question 5 Short Answer
How does the length of linkage disequilibrium blocks across a genome serve as a record of population history, and what does a genome with unusually long LD blocks tell you?
Think about your answer, then reveal below.
Model answer: LD blocks decay through recombination over generations. Long LD blocks indicate that allele combinations have not had enough time or recombination events to be broken apart — this signature arises after recent events that created new associations: a selective sweep (a beneficial mutation dragging nearby alleles along via genetic hitchhiking), a population bottleneck (random drift creating new associations in a small founder group), or recent admixture (chromosomal segments from one population entering another with their original haplotype structure intact). Short, fragmented LD blocks indicate an ancient, large, well-recombined population. A genome with unusually long LD blocks therefore suggests recent demographic events or strong recent selection.
The relationship between LD block length and evolutionary history is one of the most powerful tools in population genomics. Because the expected rate of LD decay is mathematically predictable from recombination rates, the observed excess of long blocks can be used to date when events occurred — the longer the blocks, the more recent the event. This logic underlies methods for detecting selection sweeps, identifying admixture events, and reconstructing bottleneck history from modern genetic data.