Polyadenylation is the process by which a string of adenine nucleotides, called a poly(A) tail, is added to the end of a messenger RNA (mRNA) molecule after it’s copied from DNA. This tail, typically 150 to 250 nucleotides long when first added, acts as a protective cap on the trailing end of the mRNA. It shields the molecule from being broken down too quickly, helps it get translated into protein, and plays a role in moving it out of the nucleus. Nearly every mRNA in your cells goes through this step before it can do its job.
How the Poly(A) Tail Gets Added
Polyadenylation happens in two quick stages: the mRNA is first cut at a specific spot, then the tail is built onto the freshly cut end. Both stages are carried out by a team of protein complexes working together while the mRNA is still being copied from DNA.
The main player is a protein complex called CPSF, which has three functional modules. The first module recognizes the signal in the mRNA that says “cut here” (more on that signal below). The second module contains the molecular scissors, an enzyme called CPSF73, that actually slices the mRNA. A second complex called CstF assists by grabbing onto sequences just downstream of the cut site, helping lock everything in position.
Once CPSF73 makes the cut, two pieces result. The upstream piece is the mRNA that will become a finished message. The downstream piece, which lacks a protective cap, is quickly destroyed. An enzyme called poly(A) polymerase then latches onto the upstream piece and begins adding adenine nucleotides one at a time. A nuclear protein binds to the growing tail and helps the enzyme work rapidly until the tail reaches roughly 200 to 250 nucleotides. At that point, the process stops and the mRNA is ready for export.
The Signal That Triggers It
Cells don’t add poly(A) tails at random locations. A short sequence embedded in the mRNA, called the polyadenylation signal, marks the spot. The most common signal in human genes is the six-letter sequence AAUAAA, found about 10 to 30 nucleotides upstream of where the cut will be made. A second element, a GU-rich stretch, sits 20 to 40 nucleotides downstream of the cut site. Together, these two elements bracket the cleavage point and recruit the protein machinery.
AAUAAA was long thought to appear in about 90% of human mRNAs, but a large-scale analysis of human genes found it in closer to 58% of transcripts. The next most common variant, AUUAAA, accounts for about 15%. Beyond those two, at least ten other variant signals have been identified, collectively responsible for roughly another 15% of mRNA endings. These variants tend to differ at positions 1, 2, or 5 of the six-letter code, while positions 3, 4, and 6 are highly conserved. A mutation at one of those conserved positions can cripple the signal entirely.
Why the Tail Matters for mRNA Survival
The poly(A) tail is one of the main things keeping an mRNA alive inside the cell. Proteins called poly(A)-binding proteins (PABPs) coat the tail and form a bridge to the cap structure on the opposite end of the mRNA. This creates a loop, bringing the two ends of the molecule close together. That looped shape makes it much harder for the cell’s recycling enzymes to latch on and start chewing.
Without a poly(A) tail, translation is dramatically reduced. mRNAs with short tails of 15 to 30 nucleotides show notably low translation efficiency. As the tail gets longer, efficiency improves, and longer tails also help ribosomes assemble more effectively into the clusters (called polysomes) that mass-produce proteins from a single mRNA. Recent global analyses, however, suggest that many mRNAs in the cell actually carry tails shorter than the textbook range, often 50 to 100 nucleotides, meaning tail length is actively regulated rather than fixed.
Alternative Polyadenylation Changes the Message
Most genes contain more than one polyadenylation signal, and the cell doesn’t always use the same one. When the cleavage machinery recognizes a different signal than usual, the resulting mRNA ends up with a different 3′ end. This process, called alternative polyadenylation (APA), is one of the major ways cells generate diversity from a single gene.
The consequences depend on where the alternative signal sits. If both signals are located after the protein-coding region, the two mRNA versions encode the same protein but carry different lengths of 3′ untranslated region (3′ UTR). That might sound trivial, but the 3′ UTR is packed with regulatory sequences that attract microRNAs and RNA-binding proteins. A shorter 3′ UTR means fewer binding sites, which can make the mRNA more stable and more actively translated. A longer 3′ UTR can have the opposite effect, or it can direct the mRNA to a specific location within the cell.
If the alternative signal sits within an intron or upstream of the normal stop codon, the switch can actually change the protein itself, producing a truncated or altered version. This type of APA plays roles in cell identity and function during development and tissue differentiation. Interestingly, recent work suggests that processing can even happen sequentially: a strong downstream signal is recognized first, producing a long mRNA that is then retained and re-processed at a weaker upstream signal to generate a shorter isoform.
Polyadenylation in Bacteria Works in Reverse
Bacteria also add poly(A) tails to their mRNAs, but the effect is essentially opposite. In eukaryotic cells (human, animal, plant), the tail stabilizes mRNA and promotes translation. In bacteria, polyadenylation marks an mRNA for destruction. The added tail gives degradation enzymes a grip to begin breaking the molecule apart. This difference is a striking example of evolution repurposing the same chemical modification for completely different regulatory goals.
Polyadenylation Outside the Nucleus
Not all polyadenylation happens in the nucleus during mRNA production. In certain specialized cells, particularly egg cells (oocytes) during maturation, mRNAs that were stored with short or no poly(A) tails receive new tails in the cytoplasm. This cytoplasmic polyadenylation acts as an on-switch, activating stored mRNAs at precisely the right moment during early development.
During oocyte maturation, two classes of cytoplasmic polyadenylation occur at distinct times, linked to specific cell-cycle signals. This timing ensures that the right proteins are produced in the right order as the egg prepares for fertilization and the earliest cell divisions. The process doesn’t happen during egg formation itself but begins during maturation and continues through early embryonic development.
When Polyadenylation Goes Wrong
Because polyadenylation is essential for producing stable, functional mRNA, mutations in the polyadenylation signal can cause disease. The best-studied examples involve the genes for hemoglobin. A single-letter change in the polyadenylation signal of the HBA2 gene (which encodes part of the hemoglobin molecule) causes alpha-thalassemia, a blood disorder marked by reduced hemoglobin production. Cells carrying the mutation produce less HBA2 mRNA than normal cells.
A similar mutation in the HBB gene, where the signal AATAAA is changed to AACAAA, causes beta-thalassemia. In this case, the mutated signal fails to stop the mRNA properly. The transcription machinery reads past the normal endpoint and eventually uses a backup signal further downstream, producing an abnormal, elongated transcript. The result is a sharp drop in normal hemoglobin mRNA, leading to the clinical features of thalassemia. This particular variant has been found at frequencies ranging from 0.4% in the UK to over 4% in Guadeloupe. Polyadenylation signal mutations have also been linked to increased risk of certain skin cancers through effects on the tumor-suppressor gene TP53.
Tails Are Not Pure Adenine
The textbook picture of the poly(A) tail as a uniform string of adenine residues turns out to be an oversimplification. Specific enzymes can insert cytosine, guanine, or uracil nucleotides into the tail, creating a more heterogeneous structure. These non-adenine additions affect how quickly the tail is shortened and, by extension, how long the mRNA survives. Direct RNA sequencing using long-read technology has made it possible to measure individual tail lengths and compositions across thousands of transcripts at once, revealing that tail dynamics shift significantly during disease states, including viral infections like COVID-19 where widespread changes in poly(A) tail length have been observed across the blood transcriptome.

