What Is DNA Fingerprinting and How Does It Work?

A DNA fingerprint is a unique pattern of genetic markers that can identify a specific person with near-absolute certainty. Just as no two people share the same traditional fingerprint, virtually no two people (except identical twins) share the same DNA profile. The technique works by analyzing sections of DNA that vary widely from person to person, then producing a pattern that serves as a biological ID card.

How DNA Fingerprinting Works

Your DNA is about 99.9% identical to every other human’s. The power of DNA fingerprinting lies in that remaining 0.1%, specifically in regions called short tandem repeats, or STRs. These are short sequences of DNA, just one to six chemical “letters” long, that repeat themselves over and over in a stuttering pattern. One person might have a sequence that repeats eight times at a particular spot on their genome, while another person has it repeating twelve times. Most STRs sit in noncoding regions of DNA, stretches that don’t contain instructions for building proteins but vary enormously between individuals.

By measuring the number of repeats at multiple locations across the genome, analysts can build a profile that’s essentially unique. The FBI’s Combined DNA Index System (CODIS) currently requires analysis of 20 specific STR locations. The original standard, set in the early 2000s, used 13 locations, but seven more were added in January 2017 to make profiles even more precise. Checking more locations dramatically shrinks the odds that two unrelated people could share the same profile. With just two STR locations, a match might occur in roughly 1 in 644 people. Add two more and that drops to about 1 in 414,000. A full 20-location profile makes a coincidental match vanishingly unlikely.

From Crime Scene to Profile

DNA fingerprinting starts with a biological sample. Blood, saliva, semen, bone, teeth, or hair with the root still attached all contain nucleated cells that carry a person’s full genome. Even a tiny trace can be enough, though degraded or contaminated samples sometimes require more material to produce a clear result.

Once DNA is extracted from the sample, the lab uses a process called PCR (polymerase chain reaction) to make millions of copies of the specific STR regions being tested. PCR works through repeated heating and cooling cycles. First, the double-stranded DNA is heated to separate it into two single strands. Then the temperature drops so that short “primer” sequences can latch onto the target regions. Finally, an enzyme reads each strand and builds a new complementary copy. Each cycle doubles the amount of target DNA, so after 30 or so cycles, even a minuscule sample yields enough material to analyze.

The copied fragments are then sorted by size, traditionally using a technique called capillary electrophoresis, where an electric current pulls DNA through a thin tube filled with gel. Shorter fragments move faster than longer ones, so the fragments separate into distinct bands or peaks. The result is a visual profile showing how many repeats a person carries at each STR location. Because you inherit one copy from each parent, each location shows two values (one maternal, one paternal), which together form your unique pattern.

The Origins of DNA Fingerprinting

British geneticist Alec Jeffreys developed the technique in 1984 at the University of Leicester. He and his colleagues found a way to use naturally occurring variation between individuals, specifically differences in fragment lengths created by repeating DNA sequences, to distinguish one person from another. The method got its first major real-world test in 1986, when police asked Jeffreys to help identify a man who had raped and murdered two girls in the English Midlands. The case, which led to the conviction of Colin Pitchfork, proved that DNA evidence could both identify a guilty person and, just as importantly, exonerate an innocent one. A man who had falsely confessed was cleared by the DNA analysis before Pitchfork was caught.

Forensic Identification

Criminal investigations remain the most widely known application. When biological evidence is recovered from a crime scene, analysts generate a DNA profile and compare it against profiles in databases like CODIS. The FBI’s analysis of its own database found no exactly matching five-locus profiles among millions of pairwise comparisons. The closest coincidence was a single three-locus match out of 7.6 million comparisons, underscoring how rare even partial overlaps are. With today’s 20-locus standard, a reported match is extremely strong evidence that two samples came from the same person.

That said, the statistical weight of a match depends on how calculations are performed. In one notable Manhattan murder case, frequency estimates for the same DNA match ranged from 1 in 500 to 1 in 739 billion depending on the statistical method used. Courts and forensic labs have since standardized their approaches, but the episode illustrates why the math behind a match matters as much as the match itself.

Paternity and Family Testing

DNA fingerprinting is the gold standard for establishing biological relationships. In a paternity test, the child’s profile is compared to the mother’s and the alleged father’s. At every STR location, the child must have one value that matches the mother and one that matches the biological father. If the alleged father’s profile doesn’t supply the missing values, he’s excluded. If it does match at every location tested, the probability of paternity can be calculated, typically reaching 99.99% or higher.

The same principle extends to other family relationships. Siblings share more STR values than unrelated people, half-siblings share fewer than full siblings, and so on. Geneticists quantify this using a “kinship coefficient,” which reflects the proportion of DNA two relatives are expected to share. Parent and child or two full siblings share the most (a coefficient of 1/4), half-siblings or an uncle and nephew share half as much (1/8), first cousins share 1/16, and second cousins just 1/64. By checking enough STR locations, analysts can distinguish between these relationships with high confidence.

Monitoring Bone Marrow Transplants

One lesser-known use of DNA fingerprinting is tracking the success of bone marrow or stem cell transplants. After a transplant, the patient’s blood-forming cells should gradually be replaced by the donor’s cells. By comparing the DNA profile in blood or bone marrow samples to the known profiles of both the donor and the recipient, doctors can measure what proportion of cells belong to each person.

If only donor DNA is detected, that’s called complete chimerism, meaning the transplant has fully engrafted. If both donor and recipient DNA are present, it’s mixed chimerism, which can be normal early on but may signal trouble if it persists or worsens. Rising levels of the recipient’s own DNA can be an early warning of relapse or graft failure, often detectable before clinical symptoms appear. This monitoring typically begins within the first 30 days after transplant and continues at regular intervals, giving doctors a window to intervene with adjusted treatment if the graft isn’t taking hold.

How Technology Has Evolved

Jeffreys’ original 1984 method used a technique called restriction fragment length polymorphism (RFLP), which required relatively large, high-quality DNA samples and took weeks to produce results. The introduction of PCR in the late 1980s was transformative, allowing analysts to work with far smaller and more degraded samples by amplifying the DNA before analysis.

The current frontier is next-generation sequencing (NGS), which can read millions of DNA fragments simultaneously rather than one at a time. For forensic work, NGS offers several advantages over traditional methods. It can extract more information from the same STR locations by reading the actual sequence of each repeat, not just its length. Two fragments that look identical on a traditional size-based analysis might turn out to have different internal sequences, helping analysts tease apart complex mixtures where DNA from multiple people is present. NGS platforms can also analyze many more genetic markers in a single run, which is particularly useful for degraded samples where some markers may fail to produce results with older methods.

These advances haven’t fully replaced traditional capillary electrophoresis in most forensic labs, largely because of the cost of new equipment and the need to maintain compatibility with existing databases built on the 20 CODIS core loci. But the technology is steadily moving into routine use, particularly for difficult cases involving trace or mixed samples.

What DNA Fingerprints Can and Cannot Reveal

A standard forensic DNA profile is built entirely from noncoding STR regions. It doesn’t reveal your medical history, disease risk, physical traits, or ancestry in any meaningful detail. It’s essentially a string of numbers, the repeat counts at 20 locations, that functions purely as an identifier. This is by design: forensic databases were built around markers chosen specifically because they carry no known health information, minimizing privacy concerns.

The technique also has practical limits. Identical twins share the same STR profile, so DNA fingerprinting alone cannot distinguish between them. Severely degraded samples, such as those exposed to heat, chemicals, or bacteria for extended periods, may not yield a complete profile. And while the odds of a coincidental match between unrelated people are astronomically small, they aren’t zero, which is why DNA evidence is presented as a statistical probability rather than an absolute certainty.