Why Is DNA Sequencing Important in Medicine and Beyond

DNA sequencing matters because it lets us read the precise order of chemical letters in a genome, turning biological information into data we can act on. That capability has reshaped medicine, agriculture, criminal justice, and our understanding of human origins. The first human genome cost roughly $2.7 billion and took 13 years to complete. Today, a whole genome sequence costs well under $1,500, and that plummeting price has opened the door to applications that touch nearly every part of modern life.

Diagnosing Diseases That Stump Doctors

Roughly 300 million people worldwide live with a rare disease, and many spend years bouncing between specialists without a clear diagnosis. Genome sequencing has become one of the most powerful tools for ending that search. In a large NIH-funded study of 822 families with undiagnosed conditions, sequencing the full genome produced a molecular diagnosis in about 29% of cases. That may sound modest, but these were families who had already been through extensive prior genetic testing without answers.

What’s especially striking is that about 8% of all families in the study carried genetic variants that could only be detected through full genome sequencing, not the older, more targeted methods. These included mutations buried deep in non-coding regions of DNA, structural rearrangements, and repeat expansions that standard tests simply miss. For a family that has spent years in diagnostic limbo, that 8% represents the difference between uncertainty and a name for what’s wrong.

Matching Cancer Patients to the Right Drug

Cancer is fundamentally a disease of DNA. Tumors arise from mutations, and sequencing those mutations lets oncologists choose treatments that target the specific molecular flaw driving a patient’s cancer rather than relying on broad chemotherapy alone.

Some of the clearest success stories illustrate this well. A subset of breast cancers carry extra copies of a growth-signaling gene called HER2; identifying that amplification through sequencing directs patients to a targeted therapy that blocks HER2’s overactivity. In lung cancer, certain mutations cause a growth receptor to stay permanently switched on. Sequencing the tumor reveals those mutations and opens the door to drugs designed to shut that receptor down. Patients with a specific gene fusion in their lung tumors respond well to a different class of targeted therapy entirely. In leukemia, a characteristic chromosomal rearrangement produces an abnormal protein, and the drug designed to inhibit it became one of the first major successes of precision oncology.

Without sequencing, clinicians would have no reliable way to know which molecular subtype a patient has, and many of these targeted drugs simply wouldn’t work on the wrong subtype.

Predicting How Your Body Handles Medication

The same drug at the same dose can work perfectly in one person and cause dangerous side effects in another. Much of that variation comes down to genetics, and sequencing can reveal it in advance.

A single gene involved in drug metabolism affects how your body processes roughly 20% of commonly prescribed medications, from antidepressants to pain relievers to heart drugs. If you carry variants that make the protein encoded by that gene work too fast or too slow, standard doses may be ineffective or toxic. Sequencing identifies where you fall on that spectrum before you ever take the pill.

The stakes can be even higher with certain medications. People with a specific variant in an immune-system gene face a severe, potentially life-threatening allergic reaction to a common HIV drug. A simple genetic test avoids that entirely. In one documented case, a patient’s sequencing results revealed two rare variants that slowed her metabolism of a chemotherapy drug. Without that information, she would have received a dose her body couldn’t safely clear.

Tracking Outbreaks in Real Time

During the COVID-19 pandemic, genomic sequencing became a frontline public health tool. By reading the genetic code of virus samples as they were collected, scientists could determine how many times a virus had been introduced into a country, trace which cases were linked to one another, and monitor the emergence of new variants.

New Zealand’s response demonstrated this especially well. When new COVID-19 clusters appeared after the country had eliminated community transmission, rapid genome sequencing of positive samples confirmed that cases within an outbreak shared a single genetic lineage, meaning they stemmed from one introduction event rather than multiple undetected chains. That information gave public health teams the confidence to focus their contact tracing efforts tightly, helping the country eliminate community spread a second time. The same approach now underpins ongoing surveillance of influenza, tuberculosis, and foodborne pathogens, giving epidemiologists a resolution that traditional lab methods can’t match.

Rewriting the Story of Human Origins

Sequencing ancient DNA extracted from fossils has fundamentally changed what we know about human evolution. Before ancient genomics, scientists relied on bone shape and archaeological context to reconstruct our past. Now, DNA from specimens tens of thousands of years old tells a far more detailed story.

The most dramatic revelation is that early modern humans interbred with Neanderthals and another archaic group called Denisovans. People of East Asian ancestry carry slightly more Neanderthal DNA than people of European ancestry, a finding that was initially surprising and suggests the mixing happened more than once, or that some populations later diluted their Neanderthal heritage through interbreeding with groups that hadn’t encountered Neanderthals. Even some present-day African genomes carry genetic signatures of mixing with archaic human groups that have never been identified from fossils. Without sequencing, none of this would be visible.

Building Better Crops

Feeding a growing population on a warming planet requires crops that produce more grain, resist disease, and tolerate drought and salt. Sequencing crop genomes accelerates all of those goals by identifying the specific genes responsible for desirable traits.

In rice, one of the world’s most important food crops, genome data helped researchers pinpoint a gene that controls grain number per plant. They later identified a related gene that regulates both that grain-production gene and the plant’s tolerance to drought and salt, linking yield and climate resilience at the molecular level. Once scientists know which genes matter, they can move those traits into new crop varieties through targeted breeding or genetic modification rather than decades of trial-and-error crossbreeding. The re-sequencing of 3,000 rice genomes is generating a massive dataset aimed at developing varieties with higher yields, better nutrition, and stronger resistance to the pests, diseases, and environmental stresses common in tropical agriculture.

Solving Cold Cases Through Genetic Genealogy

In April 2018, investigators used DNA sequencing and public genealogy databases to identify the Golden State Killer, a serial offender who had evaded capture for decades. That arrest introduced forensic genetic genealogy to the public, and since then, hundreds of cold criminal cases and unidentified remains cases in the United States have been resolved using the same approach. The technique works by sequencing DNA from crime scene evidence, uploading the genetic profile to genealogy databases, and building out family trees until investigators can narrow the pool to a single suspect. It has proven especially valuable in cases where traditional DNA database searches returned no matches.

Estimating Your Risk for Common Diseases

Most common diseases, including heart disease, diabetes, and many cancers, aren’t caused by a single gene. They result from the combined influence of many small genetic variations, each nudging risk up or down slightly. Polygenic risk scores aggregate those variations into a single number that reflects how your genetic profile compares to others.

For coronary artery disease, researchers have identified about 60 genomic variants that appear more frequently in affected individuals. A polygenic risk score based on those variants places you on a bell curve: most people land in the middle with average risk, while those on the tails face meaningfully higher or lower risk. The scores don’t predict when or whether you’ll develop a disease. A 22-year-old and a 98-year-old with identical scores will have very different lifetime risks. But for someone who falls in the high-risk tail, that information can prompt earlier screening, lifestyle changes, or closer monitoring that might not have happened otherwise.

Two Sequencing Technologies, Different Strengths

Modern sequencing generally falls into two camps. Short-read platforms chop DNA into small fragments and read millions of them simultaneously with high accuracy. They remain the workhorse for most clinical and research applications. Long-read platforms read much longer stretches of DNA in a single pass, which makes them better at assembling complete genomes and detecting large structural changes that short reads can miss.

Recent comparisons of the two approaches for microbial pathogen work found that long-read assemblies were more complete and, when processed with the right computational tools, produced variant calls just as accurate as short reads. That’s significant because long-read devices can be small and portable, making them useful for sequencing in the field during outbreaks rather than shipping samples to a central lab. In practice, many research groups now use both technologies, choosing the one that best fits the question they’re trying to answer.