Questions: Purifying Selection and Deleterious Mutation Removal
5 questions to test your understanding
Score: 0 / 5
Question 1 Multiple Choice
A comparative genomics study finds that the active site of an enzyme evolves 15x more slowly than nearby intergenic DNA across 50 mammalian species. The most parsimonious explanation is:
AThe active site is under strong positive selection, with beneficial mutations being fixed faster than in non-coding regions
BThe active site has an intrinsically lower mutation rate than intergenic DNA due to its GC content
CThe active site is under purifying selection — most mutations there are deleterious and are removed before they can accumulate, while intergenic DNA accumulates mutations freely
DThe active site is evolving neutrally, but constrained by structural requirements that happen to match ancestral sequence
Slow evolution relative to neutral sites (like intergenic DNA) is the signature of purifying selection, not positive selection. Positive selection would accelerate substitution rates, not decelerate them. The logic is: if an enzyme's active site performs a critical function, most amino acid changes will disrupt that function, reducing fitness. Those mutations are eliminated before they can accumulate in the population. By contrast, intergenic DNA can accumulate mutations freely because most changes have no fitness consequence. Option B is tempting but wrong: mutation rates do vary by sequence context, but a 15x difference in *substitution* rate across 50 species reflects selection against fixation, not reduced mutation.
Question 2 Multiple Choice
A mildly deleterious mutation (selection coefficient s = −0.001) arises in a small island population of 40 individuals. Compared to a large mainland population of 200,000 individuals, what is most likely to happen to this mutation?
AThe mutation will be eliminated faster in the small population because natural selection is more efficient with fewer competing alleles
BThe mutation's fate is essentially identical in both populations since the selection coefficient is the same
CIn the small population, genetic drift may overpower weak purifying selection, allowing the mutation to drift to fixation; in the large population, purifying selection efficiently removes it
DThe mutation will reach higher frequency in the large population due to mutation-selection balance dynamics
Whether selection or drift dominates depends on the product Nes (effective population size × selection coefficient). When Nes >> 1, selection is efficient; when Nes << 1, drift dominates. For N=40 and s=0.001: Nes ≈ 0.04 — drift is ~25x stronger than selection, and the mutation is likely to drift to fixation or loss by chance. For N=200,000 and s=0.001: Nes ≈ 200 — selection strongly dominates, and the mutation is efficiently purged. This interaction between drift and purifying selection explains why small, isolated populations accumulate more deleterious mutations over time — a process called Muller's ratchet.
Question 3 True / False
Purifying selection and positive selection can act simultaneously on different sites within the same gene — some positions are under strong constraint against change while others are favored for adaptive divergence.
TTrue
FFalse
Answer: True
This is routinely observed in comparative genomics. A classic example is immune genes like MHC: the antigen-binding groove shows elevated nonsynonymous substitution rates (positive selection for diversity), while the structural scaffold of the protein shows strong conservation (purifying selection against structural disruption). Another example: viral surface proteins often show positive selection at antibody-binding sites and purifying selection at sites critical for receptor binding. The dN/dS ratio, calculated separately for different codons, can detect both signatures simultaneously, which is why this ratio varies substantially across sites within a single gene.
Question 4 True / False
Evolutionary conservation of a DNA sequence is direct evidence that the sequence has been positively selected for beneficial functions.
TTrue
FFalse
Answer: False
Conservation indicates purifying selection AGAINST deleterious changes, not positive selection FOR beneficial ones. A conserved sequence is one where mutations are removed because they reduce fitness — the sequence works, and changes break it. Positive selection, by contrast, drives the accumulation of new beneficial variants, which typically increases divergence rates rather than decreasing them. While it is true that conserved sequences are usually functionally important (which is why mutations there are deleterious), functional importance is the underlying reason for conservation, not a form of positive selection. Confusing conservation with positive selection is a common error in genomics interpretation.
Question 5 Short Answer
Why do third codon positions evolve faster than first and second codon positions, and what does this difference reveal about purifying selection?
Think about your answer, then reveal below.
Model answer: The genetic code is redundant — multiple codons encode the same amino acid. Most synonymous substitutions (those that don't change the amino acid) occur at third positions due to wobble. Because these changes are often phenotypically neutral, purifying selection does not efficiently remove them, and they accumulate at rates approaching the neutral mutation rate. First and second codon positions more frequently produce nonsynonymous changes (amino acid substitutions), which are more likely to disrupt protein structure or function. Purifying selection removes these at higher rates, so they evolve more slowly. The ratio dN/dS < 1 for most genes is direct evidence of this: nonsynonymous changes are eliminated faster than synonymous ones.
This codon position rate difference is one of the strongest pieces of evidence for the pervasiveness of purifying selection at the molecular level. It also validates a key prediction of neutral theory: sequences evolve at rates proportional to how much of their variation is selectively neutral. The practical implication is that synonymous substitution rates serve as a molecular clock approximation while nonsynonymous rates reflect the balance of drift and selection on protein-coding sequences.