What Is Whole Genome Sequencing and How Is It Used?

Whole genome sequencing (WGS) is a laboratory technique that reads the entirety of a person’s DNA, all 3 billion base pairs of it, producing a comprehensive map of their genetic code. Unlike older methods that only examine small, pre-selected regions, WGS captures everything: the genes that code for proteins, the vast stretches of regulatory DNA between genes, and structural features that other tests miss entirely. The cost has plummeted from $1 million in 2007 to roughly $600 today, and it’s rapidly becoming a practical tool in medicine, cancer treatment, and newborn screening.

What WGS Actually Reads

Your DNA is made up of roughly 20,000 genes, but those genes account for only about 1% of your total genome. The other 99% was once dismissed as “junk DNA,” but scientists now know much of it plays a role in regulating when and how genes turn on, how cells develop, and how the body responds to disease. Whole genome sequencing reads all of it.

This is the key difference between WGS and its more targeted cousin, whole exome sequencing (WES). Exome sequencing reads only the protein-coding 1%, which does harbor about 85% of known disease-related variants. But WGS picks up what exome sequencing cannot: variants in the regulatory regions between genes, mutations buried inside introns (the non-coding segments within genes), and large structural rearrangements in chromosomes. Exome sequencing has very limited power to detect these structural variations, which are important in many diseases. In one comparison study, WGS identified significant genetic hits in intergenic regions that all three exome platforms completely missed.

How the Sequencing Process Works

The process starts with a simple blood draw or cheek swab. Once DNA is extracted and purified, it goes through several steps before a sequencer can read it.

First, the long strands of DNA are broken into millions of short fragments. Short adapter sequences are attached to the ends of each fragment, essentially giving the sequencing machine something to grip. The fragments are then amplified, creating many copies to work with. This collection of prepared fragments is called the “library.”

Next, the library is loaded onto a glass chip called a flow cell. Each fragment binds to the surface and gets copied thousands of times in a tiny cluster. The sequencing machine then reads each cluster one base at a time, using fluorescent chemical tags that light up in a different color for each of the four DNA letters (A, T, C, G). A laser detects the color at each step, and the machine records the sequence. Most platforms read fragments in lengths of 75 to 150 bases per pass.

After sequencing, software takes the millions of short reads and maps them back to a human reference genome, like assembling a jigsaw puzzle using the picture on the box. Algorithms then scan for places where your DNA differs from the reference, a process called variant calling. A single whole genome at standard clinical depth generates raw data files that can span up to 100 gigabytes.

Why Sequencing Depth Matters

Sequencing machines don’t read your genome just once. They read overlapping fragments many times over, and the average number of times each position gets read is called “depth” or “coverage.” Clinical WGS typically uses a mean depth of 30x to 50x, meaning each spot in your genome is read 30 to 50 times on average. Research shows that accuracy for detecting single-letter mutations and small insertions or deletions improves steadily with depth and plateaus around 40x. That 40x standard is also sufficient for identifying most larger copy number variations, where chunks of DNA are duplicated or deleted.

Diagnosing Rare and Undiagnosed Diseases

WGS has had the biggest clinical impact for patients stuck in what’s sometimes called a “diagnostic odyssey,” years of inconclusive tests for symptoms that don’t fit a clear pattern. For rare genetic diseases, WGS roughly doubles the diagnostic success rate compared to standard genetic testing. In one large study of 4,660 participants of all ages, 25% received a new diagnosis from WGS after standard tests had come up empty. Even among patients who had already undergone exome sequencing and gotten no answer, WGS still found a diagnosis in about 14.5% of cases, catching variants that exome sequencing is blind to.

These aren’t just academic findings. A confirmed genetic diagnosis can end years of uncertainty, connect families with the right specialists, identify targeted treatments, and inform decisions about having future children. In head-to-head comparisons, WGS confirmed significantly more diagnoses than conventional testing (41% versus 24%).

Cancer and Tumor Profiling

In oncology, WGS gives clinicians a complete picture of the mutations driving a tumor. Rather than running a panel that tests for a few dozen known cancer genes, WGS sequences the full genome of tumor cells and compares it to the patient’s normal DNA. This comprehensive approach catches structural rearrangements and gene fusions that targeted panels often miss.

One particularly useful measurement is tumor mutational burden (TMB), a count of how many mutations a tumor carries. A high TMB can predict whether a patient will respond to immunotherapy drugs called checkpoint inhibitors. In a large evaluation of England’s 100,000 Genomes Project, about 40% of cancer cases sequenced with WGS received a therapeutic recommendation, including clinical trial options, off-label therapies, or immunotherapy based on high TMB. Colorectal cancer patients were especially likely to have a high TMB (24% versus 10% overall), making WGS particularly informative for that group.

Predicting Drug Reactions

Your genes influence how your body metabolizes medications, and WGS captures all of the relevant variants in a single test. This application, called pharmacogenomics, uses your genetic data to predict whether a drug will work well for you, do nothing, or cause a dangerous reaction.

One well-studied example involves a liver enzyme responsible for breaking down a wide range of drugs. Genetic variation in the gene for this enzyme can produce outcomes ranging from no noticeable effect to a functional overdose at a standard dose. Clinical programs already use genomic data to guide dosing for blood thinners like warfarin and clopidogrel, matching each patient to the medication best suited for their metabolism. Pharmacogenomics is also proving useful in psychiatry, where antidepressants and antipsychotics are notoriously trial-and-error.

Because WGS captures your full genome, this pharmacogenomic information comes “for free” alongside any other analysis. A genome sequenced to diagnose a rare disease today can be re-analyzed years later when new drug-gene interactions are discovered.

Cost, Timeline, and Accessibility

The economics of WGS have shifted dramatically. The cost has dropped from about $1 million per genome in 2007 to roughly $600 today, and next-generation sequencing platforms are pushing toward a $200 genome with faster throughput. Clinical-grade results from a certified laboratory typically take about 28 days, though rapid sequencing protocols for critically ill newborns can deliver results in under a week at some centers.

Insurance coverage varies. In the United States, many insurers now cover WGS for specific clinical indications like undiagnosed rare disease, certain cancers, and critically ill infants. Consumer-grade sequencing, where healthy individuals get their genome sequenced out of curiosity or for proactive health information, is available from several companies but is generally paid out of pocket.

Limitations Worth Knowing

WGS reads your DNA with high accuracy, but reading the sequence and understanding it are two different things. A typical genome contains millions of variants compared to the reference, and for many of them, scientists simply don’t yet know whether they matter. These “variants of uncertain significance” can create anxiety without providing answers.

WGS also generates an enormous amount of data. A single genome’s raw files can reach 100 gigabytes, creating real challenges for storage, data security, and the computational infrastructure needed to analyze results. The sequencing itself is increasingly affordable, but the bioinformatics expertise required to interpret the data remains a bottleneck, especially for smaller hospitals and clinics.

Finally, WGS can uncover incidental findings, genetic risk factors for conditions you weren’t being tested for. Professional guidelines address how to handle these discoveries, but they can raise difficult personal and ethical questions about what you want to know and when.