How Did COVID Start: Wuhan, Animals, or a Lab?

COVID-19 was first identified in Wuhan, China, in December 2019, but how exactly the virus jumped to humans remains one of the most contested scientific questions of the decade. The earliest laboratory-confirmed case developed symptoms on December 1, 2019, and within weeks a cluster of pneumonia cases was linked to the Huanan Seafood Market. Despite years of investigation by international teams and intelligence agencies, no definitive answer has emerged. Two hypotheses remain on the table: the virus spilled over naturally from animals, or it escaped from a nearby research laboratory.

The First Cases in Wuhan

The initial cluster involved at least 41 people, and 27 of them (about two-thirds) had a connection to the Huanan Seafood Market, a large wholesale market that sold live animals alongside fish and meat. Local health authorities issued an alert on December 31, 2019, and the market was shut down on January 1, 2020.

One detail complicated the picture from the very beginning: the first confirmed patient, who fell ill on December 1, had no history of visiting the market. The next cases didn’t appear until December 10, nine days later. That gap raised early questions about whether the market was truly where the virus first entered the human population, or whether it simply amplified an outbreak that had already begun elsewhere.

Evidence for a Natural Animal Origin

Coronaviruses have jumped from animals to humans before. SARS in 2003 spread from horseshoe bats to civets to people. MERS reached humans through camels. The natural spillover hypothesis for COVID-19 follows a similar logic: the virus originated in bats, possibly passed through an intermediate animal, and eventually infected a person.

The closest known relative of SARS-CoV-2 is a bat coronavirus called RaTG13, identified in 2016, which shares 96% of its genome with the pandemic virus. That 4% gap represents decades of evolutionary distance, meaning RaTG13 itself isn’t the direct ancestor, but it points clearly to horseshoe bats as the broader reservoir. These bats are found primarily in Yunnan Province, more than 1,000 kilometers from Wuhan, which means an intermediate host or human transport of infected animals would likely have been necessary to bridge that distance.

Researchers also found a coronavirus in pangolins with a receptor-binding region (the part that latches onto human cells) nearly identical to SARS-CoV-2’s. But the rest of the pangolin virus genome was too different to make pangolins a plausible direct ancestor. No single animal has been found carrying the immediate precursor to SARS-CoV-2.

What Was Found at the Market

In January 2020, researchers swabbed surfaces throughout the Huanan Seafood Market. One wildlife stall stood out: 30% of samples there tested positive for SARS-CoV-2, including swabs from a cart, a machine used to remove hair and feathers from animals, and the ground. Genetic analysis of those same samples detected DNA from raccoon dogs, civets, and bamboo rats. Researchers also found animal viruses known to infect raccoon dogs and civets, including canine coronaviruses closely related to strains previously sampled from farmed raccoon dogs.

This doesn’t prove that any of these animals were infected with SARS-CoV-2. It does show that susceptible wildlife and the virus were in the same place at the same time, which is exactly the kind of environment where animal-to-human spillover has occurred in past outbreaks.

Evidence for a Laboratory Origin

Wuhan is home to the Wuhan Institute of Virology (WIV), one of China’s leading centers for coronavirus research and the location of the country’s only Biosafety Level 4 laboratory. For years before the pandemic, researchers there collected bat coronaviruses from caves in Yunnan Province and studied them to assess their potential to infect humans.

The lab’s work included genetic manipulation. In a 2015 collaboration with U.S. researchers, WIV scientists took the spike protein gene from a wild bat coronavirus and inserted it into a weakened SARS virus that couldn’t normally enter human cells. The resulting chimeric virus could recognize human cell receptors, replicate efficiently in human airway cells, and cause significant lung disease in mice engineered to carry human receptors. Existing SARS vaccines and antibodies didn’t neutralize it.

In 2018, the WIV and the U.S.-based EcoHealth Alliance submitted a funding proposal to DARPA, a U.S. defense research agency. The proposal described plans to search bat coronavirus samples for viruses with furin cleavage sites, a molecular feature that helps viruses enter human cells. If they didn’t find such viruses naturally, the researchers proposed inserting furin cleavage sites into SARS-like coronaviruses in the lab. DARPA rejected the proposal, but the document showed the WIV had the capability and interest in exactly the kind of work that could, in theory, produce a virus like SARS-CoV-2.

The Furin Cleavage Site Question

One of the most debated molecular features of SARS-CoV-2 is its furin cleavage site, a short stretch of genetic code that allows a common human enzyme called furin to cut the virus’s spike protein and help it enter cells. This feature broadens the range of human tissues the virus can infect and increases its transmissibility. Closely related bat coronaviruses don’t have this feature, which struck some scientists as unusual.

However, other human coronaviruses that cause common colds, as well as MERS, do carry furin cleavage sites. The feature has evolved independently in multiple coronavirus lineages, meaning nature has produced it before without human intervention. Whether it arose naturally in SARS-CoV-2 or was inserted in a lab remains unresolved.

What Investigations Have Concluded

In early 2021, a joint WHO-China team visited Wuhan and evaluated four possible pathways for how the virus reached humans. They rated transmission through an intermediate animal as “likely to very likely,” direct spillover from bats to humans as “possible to likely,” spread through frozen food products as “possible,” and a laboratory incident as “extremely unlikely.” That last assessment drew immediate criticism from scientists and governments who argued the team hadn’t been given enough access to lab records, databases, or personnel to rule it out.

The U.S. intelligence community reached a more divided conclusion. Four agencies and the National Intelligence Council assessed with low confidence that the virus most likely came from natural animal exposure. One agency assessed with moderate confidence that a laboratory incident was the most likely origin, pointing to the inherent risks of the WIV’s coronavirus research. Three agencies couldn’t reach a conclusion at all, with individual analysts split between both hypotheses or viewing them as equally plausible.

Signs the Virus Spread Earlier Than Recognized

Blood samples from U.S. donors suggest the virus may have been circulating before anyone realized it. The CDC tested 7,389 archived blood donations collected by the American Red Cross between December 13, 2019, and January 17, 2020, from donors in nine states. Of those, 106 showed signs of SARS-CoV-2 antibodies, and 84 had neutralizing activity, meaning the immune response was specifically targeting the virus.

The earliest reactive samples came from donations collected between December 13 and 16, 2019, from residents of California, Oregon, and Washington. That’s more than a month before the first officially recognized U.S. case on January 19, 2020. Later reactive donations turned up in Massachusetts, Wisconsin, Iowa, Michigan, Connecticut, and Rhode Island. These findings suggest the virus reached the United States, and possibly other countries, weeks before surveillance systems detected it.

Why the Question Remains Open

The core problem is missing evidence on both sides. No intermediate animal carrying the direct precursor to SARS-CoV-2 has been identified, which would be the strongest proof of natural spillover. But no internal WIV records, lab notebooks, or virus databases have been made available to independent investigators, which would be needed to rule a lab incident in or out. China took the Wuhan Institute of Virology’s virus database offline in September 2019 and has not restored public access. Surveillance systems weren’t designed to catch a novel virus in its earliest days, and key biological samples from the market were collected after it had already been cleaned.

Both hypotheses remain scientifically plausible. The gap between the closest known bat virus and SARS-CoV-2 means that somewhere, whether in the wild or in a lab, the virus underwent significant adaptation before it became capable of spreading efficiently among humans. Until that missing link is found, the origin of COVID-19 will remain an open question shaped as much by the information that was never collected, or never shared, as by the evidence in hand.