More than five years after the first cases appeared, the origin of COVID-19 remains one of the most consequential unsolved questions in modern science. What is known: a cluster of unusual pneumonia cases emerged in Wuhan, China, in December 2019, and the virus spread globally within weeks. What is still debated: whether the virus jumped from an animal to a human in a natural spillover event, or whether it escaped from a laboratory studying bat coronaviruses in the same city.
The First Known Cases
On December 12, 2019, a cluster of patients in Wuhan began experiencing symptoms of an unusual pneumonia that didn’t respond well to standard treatments. By December 31, China’s health authorities had notified the World Health Organization of multiple cases of pneumonia with an unknown cause, marked by shortness of breath and fever. All of the initial cases appeared connected to the Huanan Seafood Wholesale Market, a large indoor market that also sold live wildlife.
That link to the Huanan market became a central piece of evidence in the investigation. Spatial analysis of the earliest cases showed a high concentration of virus-positive environmental samples in the corner of the market where most live wildlife was sold. Genetic traces of SARS-CoV-2 and animals were found in the same stalls and in drains directly below them. Raccoon dogs, which are susceptible to the virus and can transmit it to other animals, were among the species sold at those stalls.
The Natural Spillover Hypothesis
The leading natural-origin explanation follows a familiar pattern: a bat coronavirus infected an intermediate animal, which then passed a mutated version of the virus to humans, likely at the Huanan market. This is roughly how the original SARS outbreak started in 2002, when a bat virus jumped to civet cats sold at live-animal markets in southern China before infecting people.
Several lines of evidence support this scenario. SARS-CoV-2’s closest known relatives are bat coronaviruses found in caves in southern China and Southeast Asia, including one called RaTG13 that shares about 96% of its genome with SARS-CoV-2. More recently discovered bat viruses from Laos are even closer matches in key parts of the genome, with one recombination analysis showing a possible 98.7% similarity to the SARS-CoV-2 spike protein.
Pangolins emerged as a leading candidate for the intermediate host. Coronaviruses found in Malayan pangolins share more genetic similarity with SARS-CoV-2 than bat coronaviruses do in one critical region: the part of the virus that latches onto human cells. Raccoon dogs are another candidate, since lab experiments confirmed they can catch the virus and spread it to other animals. No one has yet found the direct ancestor of SARS-CoV-2 in any animal, though. That missing link is a significant gap, though it took over a decade to trace the original SARS virus back to its bat reservoir.
Genetic analysis of the earliest viral samples also revealed two distinct lineages, called A and B, circulating in Wuhan at the same time. Lineage B dominated the Huanan market cases, while the earliest lineage A viruses were linked to a different wildlife market in the city. Some researchers argue that the existence of two lineages suggests at least two separate spillover events from animals to humans, which would make a lab origin less likely.
The Lab Leak Hypothesis
The Wuhan Institute of Virology, one of China’s top research labs for bat coronaviruses, is located roughly 10 miles from the Huanan market. The lab had been collecting and studying bat coronaviruses for years, including gain-of-function research that engineered chimeric viruses (combining elements from different viruses to study how they might evolve). Starting in at least 2016, researchers there conducted experiments involving RaTG13, the closest known bat relative of SARS-CoV-2.
A January 2021 U.S. State Department fact sheet disclosed that several researchers at the institute became sick in autumn 2019, before the first officially identified case of the outbreak, with symptoms consistent with both COVID-19 and common seasonal illnesses. The lab also engaged in classified research on behalf of the Chinese military since at least 2017, raising questions about transparency.
One piece of genetic evidence has fueled particular debate. SARS-CoV-2 contains a 12-nucleotide insertion that creates what’s called a furin cleavage site, a feature that helps the virus enter human cells efficiently. No other virus in its family of beta-coronaviruses carries this exact insertion. As one Caltech analysis framed it: the sequence could have jumped from a distantly related virus and evolved naturally, or someone could have inserted it in a lab. Looking at the genome alone, you cannot distinguish between the two explanations.
Why the Question Remains Unresolved
The U.S. intelligence community has investigated the question extensively and remains divided. A 2023 assessment from the Office of the Director of National Intelligence found that all agencies consider both hypotheses plausible. Four intelligence agencies and the National Intelligence Council lean toward a natural animal origin, but only with low confidence. One agency favors a lab-associated incident with moderate confidence. Three agencies couldn’t settle on either explanation.
The disagreements stem largely from how different analysts weigh the available intelligence against the scientific publications, and from significant gaps in both. China has not granted independent investigators full access to the Wuhan Institute of Virology’s databases, lab records, or early patient samples. The institute took a key database of bat virus sequences offline in September 2019 and has not restored public access. Without that data, and without finding the direct animal ancestor of SARS-CoV-2 in the wild, neither hypothesis can be confirmed or ruled out.
What makes the question so difficult is that both explanations are genuinely consistent with the available evidence. A wet market selling live wildlife in a city with a major coronavirus research lab means both scenarios had a plausible mechanism. The geographic coincidence cuts both ways: the market was a logical place for a natural spillover, and the lab was a logical place for an accidental release. Until China allows a more thorough investigation or a direct ancestor virus is found in nature, the start of COVID-19 will remain one of the defining open questions of the pandemic.

