What Does TFIID Do in Gene Transcription?

TFIID is a large protein complex that kicks off the transcription of every protein-coding gene in your cells. Its core job is to read specific DNA sequences at the start of a gene, land on that spot, and then recruit the rest of the molecular machinery needed to copy the gene into RNA. Without TFIID, RNA polymerase II (the enzyme that actually builds the RNA strand) cannot find where to begin.

How TFIID Recognizes a Gene’s Start Site

Every gene has a short stretch of DNA called the core promoter, which acts like a landing pad. TFIID’s job is to find and bind that landing pad. One well-known promoter signal is the TATA box, a short sequence rich in the letters T and A, found roughly 25 to 30 base pairs before the spot where transcription begins. But the majority of human genes lack a clear TATA box. TFIID handles both situations.

At genes without a TATA box, other short DNA motifs serve as anchors. These include the Initiator element (Inr), which sits right at the transcription start site, and two downstream elements called the MTE and DPE, located roughly 18 to 33 base pairs after the start site. Different parts of TFIID grab onto different motifs, so the complex can piece together enough contact points to hold firmly even when no single motif provides a strong signal on its own. In yeast, TFIID often latches onto “TATA-like” sequences that differ from the true TATA box by just one or two letters, supplemented by additional contacts from its other subunits.

What TFIID Is Made Of

Human TFIID contains 14 protein subunits. One is the TATA-binding protein (TBP), which physically contacts the TATA box or its variants and bends the DNA sharply when it binds. The other 13 are called TBP-associated factors, numbered TAF1 through TAF13, with six of them present in two copies within the complex. Several TAFs share structural similarities with histone proteins, the spools around which DNA wraps inside chromosomes. TAF9 and TAF6 resemble histones H3 and H4, while TAF4 and TAF12 resemble H2A and H2B. This histone-like architecture helps TFIID grip DNA in a way that complements how DNA is already packaged in the cell.

Different TAFs handle different tasks. TAF1 and TAF2 together recognize the Initiator element and downstream promoter sequences. TAF4 and TAF12 help stabilize binding to upstream DNA and recruit another factor called TFIIA. TBP itself is held in a repressed state within the complex until the right moment, preventing it from landing on random DNA sequences prematurely.

The Three-Lobed Structure

High-resolution imaging using cryo-electron microscopy has revealed that TFIID is shaped like a three-lobed structure, with lobes labeled A, B, and C. Each lobe has a distinct role in the process of finding and committing to a promoter.

Lobe C, built around TAF1, TAF2, TAF6, TAF7, and TAF8, makes the first contact with downstream promoter DNA. Lobe B, which shares several subunits with Lobe A (TAF4, TAF5, TAF6, TAF9, TAF10, and TAF12) plus TAF8, stabilizes the upstream DNA and helps recruit TFIIA. Lobe A houses TBP along with TAF11 and TAF13, which keep TBP locked down and inactive until TFIID has properly engaged the promoter.

The complex undergoes dramatic shape changes during promoter binding. Lobe A swings roughly 150 ångströms (about 15 nanometers) from its resting position near Lobe C to a new position near Lobe B. This migration moves TBP into range of the TATA box or its equivalent. Researchers have identified at least five distinct structural states: two for the free complex (canonical and extended) and three more that appear when TFIIA and promoter DNA are present (scanning, rearranged, and engaged). These transitions represent TFIID actively searching for and then locking onto the correct start site.

Loading TBP and Building the Machinery

The ultimate goal of all this structural rearrangement is to deposit TBP onto the DNA. In the free complex, TBP is clamped down by TAF11 and TAF13, its DNA-binding surface blocked. As TFIID progresses through its conformational states, those inhibitory contacts are released one by one. When TBP finally engages the promoter and bends the DNA, it physically detaches from Lobe A. This detachment exposes a binding surface for the next factor in the assembly line: TFIIB.

TFIIB is a single protein that bridges TBP to RNA polymerase II. Once TFIIB is in place, it recruits RNA polymerase II (typically already paired with another factor called TFIIF). Then TFIIH joins, bringing the enzymatic activity needed to pry open the DNA double helix and begin synthesizing RNA. TFIIA, which binds alongside TFIIB early in the process, helps stabilize the whole assembly. The resulting multi-protein structure is called the pre-initiation complex, or PIC, and it can contain over 80 individual proteins.

This ordered sequence of events means TFIID is the gatekeeper. Nothing else assembles until TFIID has committed to a promoter.

Beyond Basal Transcription

TFIID does more than just find promoters and load TBP. It also acts as an integration point for signals that turn genes up or down. Gene-specific activator proteins can interact with TAF subunits to boost transcription, while repressors can do the opposite. TFIID also reads chemical marks on histone proteins, the tags that label regions of the genome as active or silent. This allows TFIID to sense the chromatin environment around a gene and factor that information into whether transcription should proceed.

Some TFIID subunits are expressed differently depending on the cell type. TAF4, for example, is highly expressed in neural stem cells in the developing brain, and its levels drop as those cells mature into neurons. During neuronal differentiation, specific alternative forms of TAF4 produced by different splicing patterns appear in particular cell types, suggesting that variations in TFIID composition help activate neuron-specific gene programs. This means TFIID is not a one-size-fits-all machine. Cells can tune its composition to match their transcriptional needs.

Links to Human Disease

Because TFIID is essential for transcribing virtually every protein-coding gene, mutations in its subunits can have severe consequences, particularly in the developing nervous system. Researchers have identified neurodevelopmental disorders linked to mutations in at least five TFIID subunits: TAF1, TAF2, TAF4, TAF6, and TAF13. These conditions share a common feature of intellectual disability, consistent with TFIID’s central role in brain development.

Mutations in TAF1 cause an X-linked intellectual disability syndrome. TAF2 mutations lead to an autosomal recessive form of intellectual disability. TAF6 mutations cause Alazami-Yuan syndrome, and TAF13 mutations cause another recessive intellectual disability. Most recently, researchers identified eight individuals with de novo (newly arising, not inherited) loss-of-function mutations in TAF4, proposing a new condition called TAF4-related neurodevelopmental disorder, or T4NDD. Collectively, these conditions have been termed “TAF-opathies,” reflecting their shared origin in a broken TFIID complex.