Despite the genetic code's degeneracy, most organisms use certain synonymous codons more frequently than others. Codon bias reflects weak selection for translation efficiency, optimal tRNA availability, and mRNA stability. This bias can be detected in comparative genomics and affects evolutionary rates of synonymous sites.
You know from the genetic code that most amino acids are encoded by multiple codons — leucine, for example, has six. If these synonymous codons were truly interchangeable, you would expect them to appear at roughly equal frequencies. But they don't: in almost every organism examined, from *E. coli* to humans, certain codons are used far more often than their synonyms. This non-random usage is codon bias, and understanding it reveals that "synonymous" does not mean "invisible to selection."
The primary driver of codon bias is translational selection — weak but persistent natural selection favoring codons that are translated more quickly and accurately. Each codon is recognized by a specific transfer RNA (tRNA), and the cell does not produce all tRNAs in equal amounts. Preferred codons are those recognized by the most abundant tRNAs, so ribosomes spend less time waiting for the correct charged tRNA to arrive. In highly expressed genes — ribosomal proteins, metabolic enzymes, anything the cell needs in large quantities — this speed advantage matters: faster translation means the cell can produce more protein per unit time, and fewer translational errors mean fewer wasted or misfolded proteins. The fitness benefit per codon is tiny, but summed across thousands of codons in hundreds of highly expressed genes, the cumulative effect is selectable.
The strength of codon bias correlates with two factors: gene expression level and effective population size. Highly expressed genes show the strongest bias because selection for translational efficiency is strongest when the gene is translated thousands of times per cell cycle. Effective population size matters because codon bias is driven by *weak* selection — the selection coefficient per synonymous site is on the order of 10⁻⁶ to 10⁻⁸. From your understanding of the selection coefficient, you know that selection is only effective when its magnitude exceeds the reciprocal of the effective population size (s > 1/Nₑ). This is why codon bias is most pronounced in organisms with large effective population sizes like bacteria and *Drosophila*, and weaker in mammals with smaller effective population sizes where genetic drift overwhelms the feeble selective advantage of preferred codons.
Codon bias has practical implications for both evolutionary analysis and biotechnology. In evolutionary genomics, it challenges the assumption that synonymous substitutions are strictly neutral — if preferred codons are under selection, then synonymous substitution rates (dS) are not a pure molecular clock but are influenced by selection intensity, which varies among genes and lineages. In biotechnology, codon optimization — rewriting a gene to use the host organism's preferred codons — is standard practice when expressing foreign proteins in bacteria or yeast, precisely because matching the host's tRNA pool dramatically increases protein yield. The existence of codon bias is a reminder that even at the finest scale of molecular evolution, selection can shape patterns that initially appear random.