What Is Sequencing? DNA, RNA, and How It Works

Sequencing is the process of reading the exact order of chemical building blocks in a DNA or RNA molecule. Every cell in your body contains DNA made up of four bases: adenine (A), thymine (T), cytosine (C), and guanine (G). These four letters, arranged in specific combinations, form the instructions for everything from eye color to how your immune system fights infection. Sequencing technology reveals that precise arrangement, letter by letter.

How DNA Sequencing Works

DNA is structured as a double helix, two strands wound around each other. The bases on each strand pair in a predictable way: A always bonds with T, and C always bonds with G. This reliable pairing is what makes sequencing possible. Most sequencing methods exploit this rule by watching new bases pair up with a template strand one at a time, recording each addition to build a readout of the sequence.

The general workflow starts with extracting DNA from a sample (blood, saliva, tumor tissue, or virtually any biological material) and breaking it into smaller fragments. Those fragments are then tagged with short molecular labels called adapters, which let the sequencing machine grab onto them. The tagged fragments are copied many times over to strengthen the signal, and finally the machine reads the bases in each fragment. Software then stitches all the short reads back together, like assembling a jigsaw puzzle, to reconstruct the full sequence.

Types of Sequencing Technology

The first widely used method, developed in the 1970s, is Sanger sequencing. It reads one stretch of DNA at a time, typically around 650 bases per read. Sanger sequencing remains so accurate that it is still considered the gold standard for confirming individual results. But reading an entire human genome this way would be extraordinarily slow and expensive.

Next-generation sequencing (NGS) changed the field by reading millions of DNA fragments simultaneously. Instead of processing one fragment, then the next, NGS runs millions of reactions in parallel on a single chip. This massively parallel approach made it practical to sequence a whole human genome in days rather than years, and at a fraction of the cost. Platforms from companies like Illumina dominate clinical and research labs today.

A newer wave, sometimes called third-generation sequencing, reads single molecules of DNA in real time without needing to copy them first. Platforms from Pacific Biosciences and Oxford Nanopore Technologies can read extremely long stretches of DNA in one pass, tens of thousands of bases or more, compared to the few hundred bases typical of NGS. Long reads make it easier to map complex or repetitive regions of the genome that short reads struggle with. Oxford Nanopore’s devices are small enough to fit in your hand, opening the door to sequencing in remote clinics or even in the field.

Sequencing RNA, Not Just DNA

DNA contains all the instructions an organism could ever need, but not every gene is active at every moment. Cells selectively copy certain genes into RNA molecules, which then guide the production of proteins. The complete set of RNA molecules in a cell at a given time is called the transcriptome, and it reflects what the cell is actually doing rather than what it could theoretically do.

RNA sequencing (RNA-Seq) captures this snapshot. Researchers convert the RNA back into DNA, then sequence it using the same high-throughput platforms. The result reveals which genes are turned on or off, how strongly each gene is expressed, and whether any genes are being spliced into unexpected forms. This has been transformative for understanding diseases like cancer, where gene activity in a tumor cell looks dramatically different from a healthy cell.

Single-Cell Sequencing

Traditional sequencing processes thousands or millions of cells at once, blending their signals into an average. That average can hide critical differences. A tumor, for example, may contain dozens of genetically distinct cell populations, some of which resist treatment while others don’t. Single-cell sequencing isolates individual cells and reads each one separately, revealing the diversity within a tissue that bulk methods miss.

This approach has reshaped how researchers study development, immunity, and disease. By analyzing thousands of individual cells from one tissue sample, scientists can identify rare cell types, track how cells change over time, and pinpoint which specific cells drive a disease process.

Clinical Uses

Sequencing has moved well beyond the research lab. One of its most impactful clinical roles is diagnosing rare genetic diseases. Many patients with unexplained symptoms spend years cycling through specialists. Sequencing their genome or its protein-coding regions (called the exome) can identify the responsible mutation in a single test. In one notable case, exome sequencing revealed that a child diagnosed with a Crohn’s-like inflammatory bowel disease actually had a specific immune deficiency, completely changing the treatment approach.

Prenatal screening is another major application. By sequencing fragments of fetal DNA circulating in a pregnant person’s blood, clinicians can detect chromosomal abnormalities like Down syndrome without an invasive procedure such as amniocentesis.

In cardiology, sequencing has identified mutations responsible for inherited conditions like dilated cardiomyopathy and familial aortic aneurysms. For families with a history of heart disease that doesn’t follow obvious patterns, sequencing can uncover the genetic thread connecting affected relatives and guide screening for at-risk family members.

Cost and Turnaround Time

The cost of sequencing a human genome has plummeted. The original Human Genome Project, completed in 2003, cost roughly $2.7 billion. Today, a full human genome can be sequenced for as little as $475. That dramatic drop, far faster than Moore’s Law in computing, has made genomic testing accessible to ordinary clinical care rather than limiting it to elite research centers.

In a clinical setting, the turnaround from collecting a sample to receiving results averages about 18 days, though it can range from 5 to 43 days depending on the lab and the complexity of the case. Roughly 4 of those days are typically spent just getting the sample to the performing laboratory, with the remaining time split between the actual sequencing run and the computational analysis that follows.

From Raw Data to Usable Results

A sequencing machine doesn’t hand you a neatly annotated report. It produces enormous files of short DNA reads, billions of base pairs that need to be computationally assembled, compared to a reference genome, and checked for meaningful differences. This analysis pipeline typically involves aligning each read to its correct location in the genome, then scanning for variants: places where the patient’s DNA differs from the reference.

Those variants are classified as known disease-causing mutations, benign differences, or uncertain findings that may require further investigation. For cancer samples, the pipeline also looks for larger structural changes like deleted or duplicated sections of chromosomes. The entire process demands significant computing power and specialized expertise in bioinformatics, which is one reason sequencing results take days or weeks rather than hours.

The gap between generating raw sequence data and interpreting what it means for a patient remains one of the biggest challenges in genomics. The sequencing itself has become fast and cheap. Making sense of the results is where the real complexity lies.