What Is the Central Dogma of Molecular Biology?

The central dogma of molecular biology describes how genetic information flows in one direction inside living cells: from DNA to RNA to protein. Francis Crick first proposed this idea in 1958, and it remains the foundational framework for understanding how your genes produce the molecules that build and run your body. But the central dogma is more nuanced than a simple three-step arrow, and the decades since Crick’s proposal have revealed important exceptions worth understanding.

What the Central Dogma Actually States

The central dogma is often simplified as “DNA makes RNA, RNA makes protein,” but Crick’s actual claim was more specific. He was describing the transfer of sequential information, meaning the precise order of building blocks in one molecule dictating the precise order in another. His core assertion: once information has passed into protein, it cannot get out again. Proteins cannot pass their sequence information back to RNA or DNA, and they cannot pass it to other proteins.

Crick identified three molecules involved in information transfer: DNA, RNA, and protein. With three molecules, there are nine possible directions information could flow. He sorted these into three classes. General transfers (DNA to DNA, DNA to RNA, RNA to protein) happen in virtually all cells. Special transfers (RNA to RNA, RNA to DNA, DNA to protein) were plausible but confined to specific situations. The third class, the truly forbidden transfers, were all three routes leading out of protein: protein to protein, protein to RNA, and protein to DNA. Those, Crick argued, do not occur.

Step One: DNA to RNA (Transcription)

The first step in the central dogma is transcription, where a cell reads a stretch of DNA and builds a corresponding RNA copy. An enzyme called RNA polymerase attaches to a specific region of DNA, unwinds the double helix, and moves along one strand, assembling a single-stranded RNA molecule that mirrors the DNA sequence. The process has three phases: initiation (the enzyme lands on the DNA and begins building), elongation (it moves along the strand, adding one RNA building block at a time), and termination (it reaches a signal that tells it to stop and release the finished RNA).

The RNA molecule produced, called messenger RNA, carries the gene’s information out of the cell’s nucleus and into the machinery that builds proteins. But not all RNA becomes a blueprint for protein. Cells also produce many types of non-coding RNA that never get translated. Small RNAs called microRNAs can silence genes by blocking messenger RNA before it reaches the protein-building stage. Long non-coding RNAs regulate gene activity at multiple levels, including controlling which stretches of DNA are accessible and which are locked down. Circular RNAs form closed loops and can act as sponges that absorb microRNAs, indirectly influencing which proteins get made. These molecules add layers of regulation that Crick’s original framework didn’t anticipate.

Step Two: RNA to Protein (Translation)

Translation is where the cell reads the messenger RNA and assembles a protein. This happens at ribosomes, complex molecular machines found in the cell’s cytoplasm. The ribosome clamps onto the messenger RNA and reads it three letters at a time. Each three-letter group, called a codon, specifies one amino acid. There are 64 possible codons mapping to 20 amino acids plus start and stop signals, a system so consistent it is shared, with only minor variations, across all known life on Earth.

The link between RNA and protein depends on transfer RNA, a small molecule that Crick himself predicted would need to exist. Each transfer RNA carries a specific amino acid on one end and reads a specific codon on the other. It physically bridges the messenger RNA and the growing protein chain. The ribosome positions everything correctly, and as each new transfer RNA arrives with its amino acid, the chain grows longer. When the ribosome hits a stop codon, the finished protein is released and folds into its functional shape.

The genetic code itself has a striking internal logic. Codons with certain letters in the middle position tend to code for amino acids with similar physical properties, so a single-letter mutation is more likely to swap in a chemically similar amino acid rather than a drastically different one. This built-in error tolerance means small mistakes in copying or reading DNA are less likely to produce a completely nonfunctional protein.

Reverse Transcription: Information Flowing Backward

In 1970, Howard Temin and David Baltimore discovered an enzyme that seemed to violate the central dogma: reverse transcriptase. This enzyme converts single-stranded RNA back into double-stranded DNA. It was first found in retroviruses, a family of viruses that includes HIV. When a retrovirus enters a cell, it uses reverse transcriptase to copy its RNA genome into DNA, which then gets inserted into the host cell’s own chromosomes.

The discovery was initially seen as a challenge to the central dogma, but it actually fits within Crick’s framework. He had classified RNA-to-DNA transfer as a “special” transfer, meaning it was plausible but not universal. The enzyme first builds a DNA strand from the RNA template, then chews away the original RNA, then builds the second DNA strand using the first as a guide. The result is a complete double-stranded DNA copy of the viral RNA. This process is the reason retroviruses got their name: they transcribe “in reverse.”

RNA Copying Itself

Some viruses skip DNA entirely. RNA viruses like influenza and the coronavirus that causes COVID-19 use an enzyme called RNA-dependent RNA polymerase to copy RNA directly from an RNA template. The enzyme reads the virus’s RNA genome and produces new RNA copies, no DNA intermediate required. This was another of Crick’s “special” transfers: RNA to RNA. It occurs widely in the viral world and also plays a role in gene regulation in plants and fungi, but it is not part of the standard information flow in human cells.

Prions: The Protein-to-Protein Question

The most philosophically interesting challenge to the central dogma comes from prions. Prions are misfolded versions of a normal brain protein. When a misfolded prion contacts a normally folded copy of the same protein, it forces the normal copy to refold into the disease-causing shape. This misfolded form then converts more normal proteins, creating a chain reaction that accumulates in the brain and causes fatal neurodegenerative diseases like Creutzfeldt-Jakob disease in humans and mad cow disease in cattle.

This looks a lot like protein-to-protein information transfer, the very thing Crick said was impossible. The misfolded prion acts as a template, dictating the three-dimensional shape of other protein molecules. Whether this truly violates the central dogma depends on how strictly you define “sequential information.” Prions don’t change the amino acid sequence of the proteins they convert. They change only the shape. Crick was specifically talking about the transfer of sequence information, the order of building blocks. By that narrow definition, prions don’t break the rule. By a broader definition of biological information, they arguably do.

Epigenetics: Controlling the Flow

Epigenetics adds another dimension the original central dogma didn’t address. Chemical tags on DNA and on the proteins that package it can switch genes on or off without altering the underlying DNA sequence. DNA methylation, for example, can silence a gene so it never gets transcribed into RNA, effectively blocking the first step of the central dogma for that gene. These modifications can be copied when cells divide, meaning a cell can pass down patterns of gene activity to its daughter cells even though the DNA sequence is identical.

One dramatic example is X-chromosome inactivation. In cells with two X chromosomes, a long non-coding RNA coats one entire X chromosome and recruits proteins that shut it down. Every gene on that chromosome is silenced for the life of the cell and all cells descended from it. This is a case of RNA reaching back to influence DNA, not by changing its sequence, but by controlling whether its information ever flows forward. It highlights how the central dogma describes the direction of sequence information, but cells have elaborate systems layered on top that decide which sequences actually get read.

Why It Still Matters

Despite its exceptions and complications, the central dogma remains the organizing principle of molecular biology. The core insight holds: the sequence of your DNA determines the sequence of your RNA, which determines the sequence of your proteins, and that flow does not reverse at the protein stage. Every major technology in modern genetics, from genetic testing to mRNA vaccines to gene therapy, is built on this framework. Understanding where the dogma applies cleanly and where biology gets creative around its edges is what separates a textbook understanding from a real one.