A strain in microbiology is a genetic variant within a single species of microorganism. Bacteria, viruses, and fungi all contain many strains, each sharing the core identity of their species but carrying distinct genetic characteristics that set them apart from other members. These genetic differences can change how the organism behaves, how it responds to treatment, or whether it causes disease at all.
How a Strain Fits Into Microbial Classification
Microbiology organizes living things in a hierarchy: domain, kingdom, phylum, class, order, family, genus, species. A strain sits below the species level. Think of it this way: a species is like a breed of dog, and strains are the individual bloodlines within that breed. They’re all recognizably the same organism, but they differ in specific, measurable ways.
Those differences might involve genes that confer antibiotic resistance, surface proteins that the immune system recognizes, the ability to produce a toxin, or even just a minor metabolic quirk. Because strains within a species can behave so differently, microbiologists need a way to tell them apart, and much of modern microbiology revolves around doing exactly that.
Strains vs. Isolates vs. Serotypes
Three terms come up constantly in microbiology discussions, and they overlap just enough to cause confusion. An isolate is a microorganism that has been physically separated from a patient sample or environmental source and grown in the lab. It’s a single instance collected at a specific time and place. Once that isolate is characterized and shown to have distinct genetic or biochemical features, it can be designated a strain.
A serotype is a subcategory based on the molecules sitting on the organism’s outer surface. Microbiologists use three main surface markers for bacteria: cell wall antigens (called O antigens), flagellar antigens (H), and capsular antigens (K). Serotyping groups strains together by their immune-reactive surface profile, which is especially useful for tracking outbreaks. A single species might have dozens or hundreds of serotypes, and each of those serotypes may contain multiple individual strains.
Other classification tools used below the species level include phage type (which viruses can infect the strain), biotype (biochemical reaction patterns), and pathotype (whether the strain produces specific toxins or can invade certain tissues).
Why Strain Differences Matter
E. coli is the textbook example. The species lives harmlessly in the human gut, and the laboratory strain K-12 has been a workhorse of genetics research for decades. But E. coli O157:H7, a different strain of the very same species, causes severe food poisoning and can trigger kidney failure. Genetic analysis shows these two strains fall into entirely different evolutionary groups within the species. K-12 and the closely related B strain sit in group A, while O157:H7 belongs to group E.
The genomic differences are dramatic. O157:H7 carries pathogenicity islands, which are large chunks of DNA encoding toxins and invasion machinery, in locations where K-12 has completely different sequences. Researchers have even found evidence that the B strain may have once been a pathogen that lost its disease-causing genes over time, reverting to a harmless lifestyle. Mobile genetic elements called insertion sequences appear to have disrupted its virulence genes, effectively disarming it.
Antibiotic resistance also varies sharply between strains. Among Campylobacter strains, for instance, some carry a mutation in a key enzyme target plus an overactive drug-pumping system that together make the bacterium highly resistant to fluoroquinolone antibiotics. When researchers blocked the pump, resistance dropped back to levels seen in susceptible strains, even though the mutation was still present. Between 32% and 40% of tested strains showed resistance to certain common antibiotics, while only about 8% resisted ciprofloxacin. That range illustrates how unevenly resistance spreads across strains of a single species.
How Viral Strains Are Named
Viruses follow their own naming system. Influenza is the clearest example, with a naming convention maintained by the WHO and CDC. Each influenza strain name includes the virus type (A, B, C, or D), the host animal if it isn’t human, the geographic location where it was first identified, a sequence number, and the year of collection. For influenza A, two surface proteins are also specified: hemagglutinin (H) and neuraminidase (N). That’s where names like A(H1N1) and A(H3N2) come from.
So a full strain name like A/duck/Alberta/35/76 tells you it’s an influenza A virus, isolated from a duck in Alberta, catalog number 35, collected in 1976. A human strain like A/Perth/16/2019 drops the host designation entirely because human origin is the default. Influenza B viruses don’t have H and N subtypes but are split into two lineages, B/Yamagata and B/Victoria, each containing many individual strains.
How Scientists Identify Strains
Traditionally, microbiologists identified strains by growing them in the lab and observing their physical characteristics, biochemical reactions, and growth patterns. A strain might be distinguished by the sugars it can ferment, the enzymes it produces, or its shape under the microscope. These methods still work but can miss genetic differences that don’t show up as visible traits.
Whole genome sequencing has transformed strain identification. Modern pipelines extract a set of conserved housekeeping genes from a sample’s genome and compare them against reference databases. A metric called average nucleotide identity (ANI) calculates how similar two genomes are overall, allowing scientists to place an unknown organism precisely within the tree of life and determine whether it matches a known strain or represents something new. Sequencing of the 16S ribosomal RNA gene provides a quick initial identification, while full genome comparison gives the resolution needed to distinguish closely related strains.
Type Strains and Official Naming
When a new bacterial species is formally described, one specific strain is designated the “type strain.” This is the reference point for the entire species. Under the International Code of Nomenclature of Bacteria, the type strain should be a living culture deposited in a recognized culture collection where other researchers can access it. If the original author of a species name designated a type strain in their publication, that strain is accepted as the holotype. If the original type strain is lost, researchers can propose a replacement, called a neotype, by publishing a formal description and depositing the new strain in a permanent collection.
This system ensures that when two scientists in different countries refer to the same species, they’re talking about the same organism, anchored to a physical specimen anyone can examine and compare.
Strains in Biotechnology and Industry
Strain selection is the foundation of industrial microbiology. Common organisms like E. coli and baker’s yeast (Saccharomyces cerevisiae) are used to produce recombinant proteins and chemicals for industries ranging from pharmaceuticals to food processing, textiles, and biofuels. But not just any strain will do. The performance gap between an unoptimized wild-type strain and one that has been engineered or adapted for a specific task can be enormous.
A wild-type E. coli K-12 strain, for example, produces only 0.01 to 0.08 milligrams per liter of the amino acid lysine. Through targeted genetic editing of key regulatory genes in the lysine production pathway, combined with genome-wide optimization, researchers can push output orders of magnitude higher. Similar approaches have been used to engineer yeast strains that produce artemisinin (a malaria drug), 1,4-butanediol (an industrial chemical), and bakuchiol (a plant compound used in skincare as a retinol alternative).
Sometimes the challenge is tolerance rather than production. If the desired product is toxic to the microbe at high concentrations, engineers use adaptive laboratory evolution, growing the strain in gradually increasing concentrations of the toxic compound until it develops mutations that let it survive. The resulting adapted strain can then be sequenced to identify which mutations confer tolerance, and those changes can be introduced into production strains more precisely.

