The clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated protein (Cas) technology is a gene-editing tool derived from the adaptive immune system of bacteria. This natural defense mechanism allows bacteria to recognize and destroy the genetic material of invading viruses. Scientists primarily use components from Streptococcus pyogenes to manipulate DNA precisely in nearly any organism. Targeted changes rely on the precise interaction of a few specific molecular components.
The Cas Enzyme
The Cas enzyme is the protein component of the system and acts as the molecular scissor responsible for cutting the target DNA. The most widely studied version is Cas9, an endonuclease that cleaves double-stranded DNA. Variants like Cas12 and Cas13 also exist, each with unique properties and cleavage mechanisms. Cas9 requires the presence of a guide RNA molecule to become active.
Cas9 contains two distinct nuclease domains, RuvC and HNH, which are responsible for creating the break in the DNA structure. The RuvC domain cleaves one strand of the DNA double helix, while the HNH domain cleaves the complementary strand. This simultaneous action results in a clean double-strand break, typically three base pairs upstream of a specific recognition sequence. This precise, targeted break initiates all subsequent gene editing.
The Guide RNA
The guide RNA (gRNA) directs the Cas enzyme to the exact location in the genome where the edit is to occur. For laboratory applications, a synthetic single guide RNA (sgRNA) is often used, which combines the functions of the two RNA molecules found in the native bacterial system: the CRISPR RNA (crRNA) and the trans-activating crRNA (tracrRNA).
The crRNA component contains a variable spacer sequence, usually around 20 nucleotides long, that is complementary to the target DNA sequence. This sequence provides the system with its remarkable specificity, as the Cas enzyme will only bind tightly to a region that perfectly matches this sequence. The tracrRNA serves as a scaffold that binds directly to the Cas protein. It helps maintain the Cas enzyme in an active conformation and forms a duplex with the crRNA, creating the functional guide molecule.
The Protospacer Adjacent Motif (PAM)
The Protospacer Adjacent Motif (PAM) is a short, specific DNA sequence that is absolutely required for the Cas enzyme to bind and initiate DNA unwinding. This sequence is located immediately adjacent to the target site on the foreign DNA, but it is not part of the target DNA itself. For the widely used Cas9 from S. pyogenes, the canonical PAM sequence is 5′-NGG-3′, where ‘N’ can be any nucleobase followed by two guanines.
The PAM sequence functions as the initial molecular signal that allows the Cas-gRNA complex to recognize potential targets. Without the presence of this sequence, the Cas enzyme cannot stably bind or cleave the DNA, even if the guide RNA perfectly matches the target sequence. This requirement is a mechanism that prevents the bacterial immune system from attacking its own DNA. The presence and type of PAM sequence constrain which genomic locations can be targeted for editing, leading to research into Cas variants that recognize different PAMs.
How the Components Work Together
The process of gene editing begins with the assembly of the ribonucleoprotein complex (RNP), which consists of the Cas enzyme bound to the single guide RNA. This RNP complex scans the genome, searching for potential targets. The initial recognition step involves the Cas protein checking the DNA for the presence of the PAM sequence.
Once the PAM is located, the Cas enzyme initiates localized unwinding of the DNA double helix adjacent to the motif. This unwinding allows the guide RNA’s spacer sequence to test for base pairing complementarity with the target DNA strand. If the gRNA successfully pairs with the target DNA, a stable R-loop structure forms, confirming the correct target site. This stable binding triggers a conformational change in the Cas enzyme, activating its nuclease domains.
The activated Cas enzyme then uses its RuvC and HNH domains to cleave both strands of the DNA, creating the double-strand break. This break activates the cell’s natural DNA repair machinery, which primarily uses one of two pathways.
DNA Repair Pathways
Non-homologous end joining (NHEJ) is often error-prone, resulting in small insertions or deletions that disrupt gene function.
Homology-directed repair (HDR) is more precise but less frequent, incorporating a provided template to insert, delete, or replace a specific sequence.

