Heterogeneous nuclear RNA (hnRNA) is the raw RNA transcript produced when a gene is copied from DNA inside the cell nucleus. It’s essentially the unprocessed, unedited version of messenger RNA (mRNA), containing both the coding sequences the cell needs and large stretches of non-coding material that will be cut out before the molecule can direct protein production. The term “heterogeneous” refers to the wide range of sizes these molecules come in, typically between 2,400 and 13,500 nucleotides long, which is two to six times larger than the finished mRNA they eventually become.
How hnRNA Relates to Pre-mRNA
You’ll often see hnRNA and pre-mRNA used interchangeably, and for most purposes they refer to the same thing: the initial transcript that hasn’t yet been processed into mature mRNA. The term hnRNA came first historically, coined in the 1960s when researchers discovered a population of large, varied RNA molecules confined to the nucleus. Scientists initially called this material “DNA-like RNA” because its base composition mirrored the cell’s DNA rather than the ribosomal RNA they were already familiar with. A few years later, in 1965, American researchers independently characterized the same molecules and gave them the name heterogeneous nuclear RNA.
The subtle distinction, when one is made, is that hnRNA is a broader descriptive term for any large RNA found in the nucleus, while pre-mRNA specifically refers to a transcript destined to become messenger RNA. In practice, the two overlap so heavily that most modern textbooks treat them as synonyms.
What hnRNA Contains
A freshly transcribed hnRNA molecule is a complete copy of a gene, including all its introns (non-coding segments) and exons (coding segments). It also carries sequences at both ends that will be modified during processing. The average size of these molecules is around 4,200 to 4,800 nucleotides, though individual transcripts vary enormously depending on the gene. Molecules carrying a poly(A) signal make up roughly 20% of total nuclear RNA mass.
hnRNA also contains regions of internal structure. About 1% of the nucleotides form stable double-stranded sections where complementary sequences fold back on themselves. An additional 1.5 to 2% form less stable paired regions. These internal structures aren’t random artifacts. They exist in the living cell and may serve as landing pads for regulatory proteins involved in processing.
Proteins That Protect and Shape hnRNA
The moment RNA polymerase II begins producing an hnRNA transcript, a family of proteins called heterogeneous nuclear ribonucleoproteins (hnRNPs) latches onto it. These proteins coat the naked RNA almost immediately, stabilizing it and preventing it from being degraded. The resulting complex of RNA and protein is sometimes called an hnRNP particle.
The hnRNP protein family is large, and its members do more than just protect the transcript. Different hnRNP proteins help control which sections of the RNA get cut out during splicing, stabilize the molecule during transport, and influence how quickly the finished mRNA is translated into protein once it reaches the cytoplasm. One member of the family, hnRNP K, carries special signal sequences that let it shuttle back and forth through the pores of the nuclear envelope, ferrying RNA cargo between the nucleus and cytoplasm.
Three Steps That Convert hnRNA to mRNA
Before hnRNA can leave the nucleus and guide protein production, it undergoes three major modifications. These steps happen in a coordinated sequence, often beginning while the transcript is still being made.
5′ Capping
The first modification happens fast. Once RNA polymerase II has produced the first 25 to 30 nucleotides of the transcript, enzymes add a protective chemical cap to the leading end. This cap is a modified guanine nucleotide attached in a reversed orientation. It shields the RNA from being chewed up by enzymes and later helps ribosomes recognize the molecule for translation.
Splicing
This is the most dramatic editing step. A large molecular machine called the spliceosome identifies each intron, loops it out into a lasso-shaped structure called a lariat, and precisely joins the flanking exons together. The process requires exact recognition of where each intron begins and ends, since even a single-nucleotide error would shift the reading frame and produce a nonfunctional protein. For genes with many introns, splicing can remove thousands of nucleotides from the original transcript.
Splicing also creates an opportunity for regulation. By including or skipping certain exons, cells can produce different protein variants from the same gene. This alternative splicing is one reason human cells can make far more proteins than they have genes.
3′ Polyadenylation
At the trailing end of the transcript, an enzyme cuts the RNA about 10 to 30 nucleotides downstream of a specific signal sequence (AAUAAA in human cells). A dedicated enzyme then adds a long tail of adenine nucleotides, typically 100 to 250 of them, to the freshly cut end. This poly(A) tail helps the finished mRNA remain stable in the cytoplasm and assists with export through the nuclear pore.
Why hnRNA Stays in the Nucleus
Only fully processed mRNA is normally exported to the cytoplasm. The cell has quality-control mechanisms that retain incompletely spliced or improperly processed transcripts inside the nucleus, where they are eventually degraded. This means much of the hnRNA produced never becomes functional mRNA. The portion that is destroyed during processing represents a significant fraction of the original transcript, since introns often make up 90% or more of a gene’s length.
What Happens When Processing Goes Wrong
Errors in hnRNA processing are linked to several serious diseases, particularly those affecting motor neurons. Spinal muscular atrophy (SMA) results from insufficient levels of a protein needed to assemble the splicing machinery. When the spliceosome can’t form properly, specific splicing events go awry, and motor neurons are especially vulnerable to the consequences.
Amyotrophic lateral sclerosis (ALS) provides another example. A protein called TDP-43, which normally binds to hnRNA at specific sequences near splice sites and recruits hnRNP proteins to regulate splicing, is found in abnormal clumps in the nerve cells of people with sporadic ALS and some forms of frontotemporal dementia. Mutations in TDP-43 that disrupt its ability to bind hnRNP proteins are predicted to reduce its splicing regulation, leading to the production of faulty transcripts. A related protein called FUS, which also associates with hnRNP proteins and participates in splicing, is mutated in some families with inherited ALS.
These examples underscore that hnRNA processing isn’t just a passive editing step. It’s a tightly regulated process, and the proteins that manage it are essential for normal cell function, particularly in neurons that depend on precise control of gene expression.

