Where Do Enhancers Bind and What Proteins Attach?

Enhancers bind to non-coding regions of DNA, often thousands of base pairs away from the genes they control. They can sit upstream, downstream, or even within the introns of their target genes. What makes enhancers unusual is that their position in the linear DNA sequence doesn’t need to be close to a gene’s promoter for them to work. Some enhancers operate from more than a megabase (one million base pairs) away, yet still activate transcription by physically looping through three-dimensional space to contact their target promoter.

Location in the Genome

Enhancers don’t have a single fixed address. They’re scattered throughout the genome in intergenic regions (the stretches of DNA between genes) and within the introns of genes themselves. Some enhancers regulate the very gene they sit inside, while others skip over neighboring genes entirely to activate a distant target. This flexibility is part of what makes enhancers so powerful and so difficult to study.

Researchers categorize enhancer-to-promoter distances into rough groups: short range (2 to 10 kilobases), mid-range (10 to 40 kilobases), and long range (50 to 500 kilobases). Some interactions stretch even further, spanning over a megabase. As the distance increases, the genes controlled by those enhancers tend to be more tissue-specific, meaning they’re active in only one or a few cell types rather than broadly across the body.

How Distant Enhancers Reach Their Targets

The key to understanding enhancer binding is that DNA inside a cell isn’t stretched out like a straight line. It’s folded, coiled, and organized into loops. This three-dimensional architecture allows an enhancer that looks far away on a map of the genome to physically touch the promoter of its target gene. Think of it like pinching two distant points on a ribbon together: the intervening DNA loops out, and the enhancer and promoter meet.

Two proteins play major roles in creating and maintaining these loops. Cohesin acts like a ring that threads DNA through itself, actively extruding loops of chromatin. CTCF acts as a stop sign, halting cohesin at specific positions and helping define the boundaries of loop domains called topologically associating domains, or TADs. These TADs function as neighborhoods, keeping enhancers and their target promoters in the same general vicinity and preventing enhancers from accidentally activating the wrong genes.

Interestingly, when researchers removed CTCF or cohesin from cells, most enhancer-promoter connections and gene expression levels held steady in the short term. This suggests that while these structural proteins help establish and fine-tune enhancer-promoter contacts, other mechanisms also hold these interactions together. Cohesin may be especially important for helping transcription factors efficiently search for and find their binding sites across long distances.

What Binds at the Enhancer Itself

An enhancer is a stretch of DNA, typically a few hundred base pairs long, packed with short sequence motifs that serve as landing pads for transcription factors. These proteins recognize and bind specific DNA sequences within the enhancer, and it’s their collective activity that gives the enhancer its power. No single transcription factor typically does the job alone. Instead, multiple factors bind cooperatively, meaning the binding of one helps recruit or stabilize others nearby.

The specific transcription factors vary by cell type and enhancer, but certain families appear repeatedly. Studies in fruit fly cells found that factors like Trithorax-like (Trl), Twist, and members of the GATA family were among the most common occupants of active enhancers. Some factors tend to bind at the central “initiating” site of an enhancer, while others fill in at secondary positions. This cooperative binding between distant sites within the same enhancer is a hallmark of enhancers that are actively driving gene expression.

Once transcription factors land on the enhancer, they recruit co-activator proteins. The Mediator complex is one of the most important: it acts as a physical bridge between the enhancer-bound transcription factors and the RNA polymerase II machinery sitting at the promoter. Enzymes like p300 and CBP are also recruited to enhancers, where they add chemical tags (acetyl groups) to nearby histone proteins, further opening up the chromatin and reinforcing the enhancer’s active state.

How Cells Choose Which Enhancers to Use

Your genome contains hundreds of thousands of potential enhancer sequences, but only a fraction are active in any given cell. A liver cell uses a completely different set of enhancers than a neuron, even though both carry the same DNA. The difference comes down to which transcription factors are present in each cell type and what chromatin landscape they encounter.

Pioneer transcription factors are the first movers. They can bind to DNA even when it’s tightly wrapped around histones and relatively inaccessible. By binding, they begin to open the chromatin, making it possible for other transcription factors to follow. This process effectively “primes” certain enhancers for activation in a specific cell lineage, sometimes well before the genes they control are actually turned on.

Scientists can read the activation status of an enhancer by looking at chemical marks on the histone proteins wrapped around it. A histone modification called H3K4me1 marks enhancer regions regardless of whether they’re active or inactive. When a second mark, H3K27ac, is also present, the enhancer is actively driving transcription. Enhancers carrying H3K4me1 but lacking H3K27ac are considered “poised,” meaning they’re ready to activate but haven’t yet received the right signal. Some poised enhancers also carry repressive marks like H3K27me3, keeping them firmly silenced until the cell receives a developmental or environmental cue.

Super-Enhancers: Clusters With Outsized Influence

Not all enhancers are created equal. Super-enhancers are clusters of individual enhancers packed closely together in the genome, and they drive exceptionally high levels of gene expression. In mouse embryonic stem cells, researchers identified about 8,800 regular enhancers averaging around 700 base pairs in size, but only 231 super-enhancers, which averaged 8.7 kilobases each. Super-enhancers are loaded with unusually high concentrations of the Mediator complex, transcription factors, and the active histone mark H3K27ac. They tend to control genes that define a cell’s identity, making them critical players in development and, when disrupted, in disease.

When Enhancer Binding Goes Wrong

Because enhancers control when and where genes are expressed, mutations in enhancer sequences can cause disease even though no protein-coding gene is directly damaged. Several congenital conditions trace back to a single mutation in an enhancer. A variant in an enhancer for the RET gene is a common cause of Hirschsprung disease, a condition where nerves are missing from parts of the intestine. Mutations in an enhancer for PTF1A cause isolated pancreatic agenesis, where the pancreas fails to develop. And a variant in a regulatory sequence controlling the Sonic Hedgehog gene leads to triphalangeal thumb, where the thumb develops with three bones instead of two.

These examples illustrate that enhancers aren’t just fine-tuning knobs. They’re essential switches, and losing or altering an enhancer binding site can have consequences as severe as losing the gene itself.

How Scientists Map Enhancer Locations

Identifying where enhancers bind across the genome requires specialized tools. ChIP-seq (chromatin immunoprecipitation followed by sequencing) is one of the most widely used methods. It works by using antibodies to pull down proteins or histone marks associated with enhancers, then sequencing the DNA fragments that come along for the ride. By targeting marks like H3K4me1 and H3K27ac, researchers can build genome-wide maps of active and poised enhancers.

A complementary technique called STARR-seq takes a more functional approach. It tests millions of DNA fragments simultaneously to see which ones can actually boost gene expression. Fragments of the genome are cloned downstream of a minimal promoter, and any fragment with enhancer activity will transcribe itself, producing RNA that can be sequenced and quantified. This gives researchers a direct, quantitative readout of enhancer strength rather than just a prediction based on histone marks. Together, these methods have revealed that enhancers are far more numerous than genes, numbering in the hundreds of thousands across the human genome.