DNA is the molecule that stores and transmits hereditary information in nearly all living organisms. It holds the instructions your cells need to build proteins, maintain themselves, and pass traits from one generation to the next. The entire system works through a precise chemical code, physical packaging inside cells, and a reliable copying process that keeps genetic information intact over time.
How DNA Stores Genetic Information
A DNA molecule consists of two long chains wound around each other in the familiar double helix shape. Each chain is built from four types of smaller units called nucleotides, and the only thing that differs between them is the chemical base they carry: adenine (A), cytosine (C), guanine (G), or thymine (T). These four bases function like letters in a four-letter alphabet. The specific order they appear in along a strand of DNA spells out biological instructions, much the way a specific sequence of letters spells out words and sentences.
The two chains are held together by pairing rules: A always pairs with T, and C always pairs with G. This complementary pairing is what allows the molecule to be copied accurately and what gives the double helix its structural stability. The sugar and phosphate portions of each nucleotide link together to form a sturdy backbone, while the paired bases sit in the interior like rungs on a twisted ladder.
A gene is a stretch of this DNA sequence that contains the instructions for building one protein (or sometimes a few related ones). The human genome contains roughly 20,000 to 25,000 protein-coding genes spread across 23 pairs of chromosomes, with each chromosome averaging over 100 million base pairs of DNA.
Packaging DNA Inside the Cell
If you stretched out all the DNA from a single human cell, it would measure about two meters long. Fitting that much material into a microscopic nucleus requires an elaborate packing system. Proteins called histones do the heavy lifting. DNA carries a negative electrical charge along its backbone, and histones carry a positive charge, so the two bind tightly together. The DNA wraps around clusters of eight histone proteins to form small spools called nucleosomes, each holding about 166 base pairs of DNA. Under an electron microscope, a chain of nucleosomes looks like beads on a string.
These nucleosome chains coil further into thicker fibers, which loop and fold again until they form the compact structures we recognize as chromosomes. This layered compaction isn’t just about saving space. It also helps regulate which genes are active at any given time, because tightly packed regions are harder for the cell’s machinery to read than loosely packed ones.
From DNA to Protein
Storing information is only useful if the cell can read it. The process of converting a gene’s instructions into a working protein happens in two major steps, often summarized as the “central dogma” of molecular biology: information flows from DNA to RNA to protein.
In the first step, called transcription, the cell makes a single-stranded copy of a gene using a molecule called messenger RNA (mRNA). An enzyme travels along the DNA strand, reading the base sequence and assembling a complementary RNA strand. Once this mRNA is processed and finalized, it exits the nucleus and heads to a ribosome, the cell’s protein-building machinery.
In the second step, called translation, the ribosome reads the mRNA three bases at a time. Each three-base group, known as a codon, specifies one particular amino acid. Small adapter molecules called transfer RNA (tRNA) ferry the correct amino acids to the ribosome, matching each codon on the mRNA. The ribosome links these amino acids together in order, producing a chain that folds into a functional protein. Proteins then carry out the vast majority of tasks in the body, from giving your hair its texture to breaking down the food you eat. This is how a sequence of DNA bases ultimately becomes a visible trait.
How DNA Copies Itself
Every time a cell divides, it first duplicates its entire DNA so each new cell receives a complete set of instructions. The two strands of the double helix separate, and specialized enzymes called DNA polymerases build a new complementary strand along each original one. Because A pairs with T and C pairs with G, the process produces two identical double helixes from one original.
Accuracy matters enormously here. Cells use at least three overlapping safeguards to keep errors to a minimum. First, DNA polymerases are highly selective about which nucleotide they insert. Second, a built-in proofreading function lets the enzyme back up and remove a mismatched base immediately after placing it. Third, a post-replication repair system scans newly made DNA for any errors that slipped through the first two checks. Together, these mechanisms make DNA replication remarkably faithful, though rare mistakes (mutations) do occur and are one source of genetic variation.
Passing Genes to the Next Generation
Hereditary information travels from parent to offspring through specialized reproductive cells called gametes (sperm and eggs in humans). These cells are produced by a type of cell division called meiosis, which differs from ordinary cell division in two important ways.
First, meiosis cuts the chromosome number in half. Human body cells carry 46 chromosomes (23 pairs), but each gamete ends up with just 23. When a sperm and egg fuse at fertilization, the resulting embryo has the full 46 again. Without this reduction, chromosome numbers would double with every generation.
Second, meiosis shuffles genetic material. Before the cell divides, matching chromosomes from your mother and father line up side by side and physically swap segments of DNA in a process called recombination or crossing over. This means each gamete carries a unique combination of alleles, which is why siblings (other than identical twins) look different from one another despite having the same two parents. The random assortment of chromosomes during division adds even more variety. These mechanisms together ensure that every offspring inherits a novel mix of hereditary information.
Epigenetics: Information Beyond the DNA Sequence
The DNA sequence itself isn’t the only form of heritable information. Chemical tags can be added to DNA or to the histone proteins it wraps around, changing how the cell reads certain genes without altering the underlying code. The most studied of these modifications is DNA methylation, where a small chemical group attaches to specific bases. Methylation typically silences a gene, reducing or stopping its protein production, while removal of the tag can reactivate it.
These epigenetic changes can be influenced by diet, stress, chemical exposures, and other environmental factors. Some epigenetic patterns are passed along when cells divide, and certain modifications may even be transmitted from parent to child. Unlike mutations, epigenetic changes are reversible, giving cells a flexible layer of control over gene activity on top of the permanent genetic code.
When RNA Stores Hereditary Information
DNA is the standard storage molecule across plants, animals, fungi, and bacteria, but it isn’t universal. Some viruses use RNA instead. Retroviruses, for example, carry two copies of a single-stranded RNA genome inside their protein shell. After a retrovirus enters a host cell, an enzyme called reverse transcriptase converts that RNA into DNA, which then integrates into the host’s own chromosomes. Once embedded, the viral DNA is copied along with the host’s genes every time the cell divides. This is a reversal of the usual information flow, running from RNA back to DNA, and it shows that the storage and transmission of hereditary information can take more than one molecular form.

