The central dogma of molecular biology describes the directional flow of genetic information: DNA is transcribed into RNA, which is then translated into protein. DNA also replicates to pass information to daughter cells. This framework, articulated by Francis Crick in 1958, defines the canonical path by which stored genetic sequence becomes functional molecules. Exceptions such as reverse transcription (RNA to DNA) in retroviruses exist but do not contradict the general principle that protein sequence cannot be reverse-translated back into nucleic acid.
Diagram the three processes — replication, transcription, translation — with arrows indicating direction of information flow. Connect each step to a specific cellular location (nucleus vs. cytoplasm in eukaryotes).
From your study of DNA structure, you know that DNA is a double-stranded polymer encoding information in the sequence of its bases. But DNA itself does nothing — it is an inert archive. The central dogma describes how that stored information gets converted into molecules that actually *do* things: proteins. Francis Crick's 1958 formulation identified three processes: DNA replicates to copy itself, DNA is transcribed into RNA, and RNA is translated into protein. The arrow of information always points away from DNA and toward protein, never in reverse.
Think of DNA as a master blueprint locked in the nucleus (in eukaryotes). You would never let workers handle the original, so instead you make a working copy — messenger RNA — and send that to the construction site. The ribosome, guided by the mRNA sequence, assembles the protein by reading each three-nucleotide codon and adding the corresponding amino acid. The sequence of the mRNA dictates the sequence of the protein, which in turn determines the protein's shape and function. Information flows from one type of molecule to another, but the information content — the sequence — is what is preserved.
The most important conceptual boundary the central dogma draws is at protein. Once information has been translated into an amino acid sequence, it cannot flow back into nucleic acid. This rules out Lamarckian inheritance: a muscle you develop through exercise does not write that information back into your DNA to pass on to children. It also explains why acquired characteristics — scars, skills, environmental adaptations in the body — are not heritable at the genetic level. The body changes, but the genome's sequence does not (barring mutations).
Retroviruses like HIV complicate the simple "DNA → RNA → protein" summary: their genome is RNA, and they carry reverse transcriptase, an enzyme that writes RNA information back into DNA. This DNA then integrates into the host chromosome as a provirus. This is a genuine exception to the transcription arrow, but it does not violate the core constraint — reverse transcriptase makes DNA from RNA, but no enzyme writes DNA from protein. Knowing this exception matters because reverse transcriptase is also a drug target: HIV antiretrovirals inhibit this enzyme precisely because it is absent in human cells.