What Happens in Transcription and Translation?

Transcription converts your DNA into an RNA copy, and translation uses that RNA copy to build a protein. Together, these two processes form the core of gene expression: how the instructions stored in your genes become the molecules that do actual work in your body. They happen in sequence, with transcription coming first, and each involves its own molecular machinery, location, and set of steps.

Where Each Process Takes Place

In human cells and other complex (eukaryotic) organisms, transcription and translation happen in separate compartments. Transcription takes place inside the nucleus, where your DNA is stored. The RNA message then travels out of the nucleus and into the cytoplasm, where translation occurs on ribosomes. Some ribosomes float freely in the cytoplasm, while others are attached to internal membranes, where they make proteins destined for export out of the cell or for use within those membranes.

Bacteria handle things differently. They have no nucleus, so transcription and translation happen in the same space and can even occur simultaneously: ribosomes can start translating an RNA message before it’s fully finished being transcribed.

Transcription: Copying DNA Into RNA

Transcription is the process of reading one strand of DNA and building a complementary strand of RNA. The enzyme responsible is RNA polymerase, which binds to a specific region of DNA near a gene, unwinds the double helix, and assembles an RNA strand one building block at a time. The process follows three stages: initiation, elongation, and termination.

During initiation, RNA polymerase recognizes and attaches to a promoter, a stretch of DNA that signals where a gene begins. In human cells, a collection of helper proteins (called transcription factors) assist RNA polymerase in finding and binding to the right spot. Once locked in place, the enzyme pries open the DNA double helix to expose the template strand.

Elongation is the main phase of RNA construction. RNA polymerase moves along the DNA, reading one base at a time and adding the matching RNA base to a growing strand. Each cycle involves the incoming building block binding to the enzyme, a chemical bond forming to attach it to the chain, a byproduct being released, and the enzyme sliding forward one position to repeat the process. In human cells, RNA polymerase II travels at roughly 18 to 42 bases per second along DNA, though speeds up to 100 bases per second have been reported.

Termination occurs when RNA polymerase encounters a signal in the DNA that tells it to stop. The enzyme releases the newly made RNA strand and detaches from the DNA, which rewinds behind it.

RNA Processing Before Translation

In human cells, the initial RNA transcript (called pre-mRNA) isn’t ready for translation yet. It undergoes three major modifications before leaving the nucleus.

First, a protective cap is added to the front end of the molecule. This happens early, typically after only about 20 building blocks have been assembled. The cap acts as a shield against degradation and later helps ribosomes recognize the message. Second, sections of non-coding sequence called introns are cut out, and the remaining coding sections (exons) are spliced together. Third, a tail of repeated adenine bases, called a poly(A) tail, is added to the back end. This tail helps the finished mRNA get exported from the nucleus and adds further stability.

These three steps are tightly linked. Research on human cells has shown that if the cap isn’t properly completed, the later steps of splicing and tail addition often fail. Properly capped messages proceed through the full processing pipeline, while defective ones are degraded. Only fully processed mRNA is exported to the cytoplasm for translation.

The Genetic Code: How RNA Encodes Proteins

The language connecting RNA to protein is the genetic code. Ribosomes read mRNA three bases at a time. Each three-base unit is called a codon, and each codon specifies a particular amino acid (the building blocks of proteins). With four possible bases in three positions, there are 64 possible codons, but they encode only about 20 standard amino acids. This means most amino acids are represented by more than one codon. Leucine, serine, and arginine each have six codons, for example, while methionine and tryptophan each have only one. Three of the 64 codons (UAA, UAG, and UGA) don’t code for any amino acid. Instead, they signal the ribosome to stop.

Translation: Building Proteins From RNA

Translation is divided into four phases: initiation, elongation, termination, and ribosome recycling. It takes place on ribosomes, which are molecular machines made of two subunits, one small and one large, that clamp together around the mRNA.

During initiation, the small ribosomal subunit, along with a set of helper proteins called initiation factors, attaches to the mRNA and scans along it until it finds a start codon (AUG). This codon marks the beginning of the protein-coding sequence. Once the start codon is recognized, the large ribosomal subunit joins, and the complete ribosome is ready to begin building.

How the Ribosome Adds Amino Acids

The ribosome has three internal sites where transfer RNA (tRNA) molecules pass through, labeled A, P, and E. Each tRNA carries a specific amino acid and has a three-base anticodon that matches a codon on the mRNA. A dedicated family of enzymes loads the correct amino acid onto each tRNA before it arrives at the ribosome. This loading step is critical: it’s the point where the nucleotide world of RNA is physically linked to the amino acid world of proteins.

Elongation works as a repeating cycle. A loaded tRNA enters the A (aminoacyl) site, where its anticodon pairs with the mRNA codon. If the match is correct, the ribosome catalyzes a bond between the amino acid in the A site and the growing protein chain held in the P (peptidyl) site. Then the ribosome shifts forward by one codon: the now-empty tRNA moves to the E (exit) site and leaves, the tRNA carrying the growing chain moves from A to P, and a fresh codon is exposed in the empty A site for the next tRNA. This cycle repeats for every codon in the message. In living cells, the ribosome adds roughly 5 to 20 amino acids per second, with speeds near 20 amino acids per second observed in vivo.

Stopping and Recycling

When the ribosome reaches one of the three stop codons, no matching tRNA exists to fill the A site. Instead, a release factor protein recognizes the stop codon, triggers the release of the finished protein chain, and the ribosome splits apart. The two subunits, the leftover tRNA, and the mRNA are all separated in a recycling process so they can be reused in future rounds of translation. Termination and recycling are tightly coordinated: the same machinery that splits the ribosome apart also speeds up the final release of the protein during termination.

How Accurate These Processes Are

Translation is remarkably precise, but not perfect. The ribosome makes an error roughly once every 1,000 to 10,000 codons, meaning it inserts the wrong amino acid at that frequency. For a typical protein of 300 to 500 amino acids, most copies are made correctly, but a small fraction contain a single substitution. The error rate varies depending on which codon is being read. Some codons are misread at rates as high as 1 in 100, while others are nearly error-free at 1 in 10,000. The ribosome uses a proofreading step to check whether the incoming tRNA truly matches the codon, rejecting most incorrect pairings before the amino acid is added to the chain.

From Gene to Working Protein

The full journey from gene to protein involves DNA being copied into pre-mRNA in the nucleus, that pre-mRNA being capped, spliced, and tailed to become mature mRNA, the mRNA traveling to the cytoplasm, and ribosomes reading it three bases at a time to assemble a chain of amino acids. That chain then folds into a three-dimensional shape, becoming a functional protein. A single mRNA molecule can be translated by multiple ribosomes simultaneously, producing many copies of the same protein in a short time. This is how your cells efficiently convert genetic information into the thousands of different proteins needed to keep you alive.