Working memory and attention are so deeply intertwined that many researchers consider them inseparable. They rely on overlapping brain regions, develop along the same timeline, and constrain each other’s performance. Your ability to hold information in mind depends on how well you direct attention, and your ability to focus depends on what your working memory can maintain. Understanding this relationship explains a lot about why you lose your train of thought in a noisy room, why some people seem to juggle mental tasks effortlessly, and why conditions like ADHD affect both capacities at once.
What Each System Actually Does
Working memory is the mental workspace where you hold and manipulate small amounts of information for short periods. It’s what you use when you keep a phone number in mind while walking to find a pen, or when you follow a multi-step recipe without re-reading the instructions. It’s not just passive storage. You’re actively doing something with the information: comparing, rearranging, updating.
Attention is the process of selecting what gets priority in your brain. It determines which sights, sounds, or thoughts make it into conscious awareness and which get filtered out. Without attention acting as a gatekeeper, your working memory would be flooded with irrelevant information and collapse under the load.
How Attention Controls Working Memory
One of the most influential models of working memory, developed by Alan Baddeley, includes a component called the central executive. This isn’t a storage bin. It’s an attentional control system that coordinates everything else. Its functions include focusing attention, switching between tasks, updating the contents of memory, inhibiting irrelevant information, and monitoring ongoing goals. In other words, the command center of working memory is itself an attentional mechanism.
A different model, proposed by Nelson Cowan, frames working memory as a hierarchy of activation levels. At the base sits long-term memory. A subset of that becomes temporarily activated (primed and ready for use). And within that activated subset, a smaller portion sits in the focus of attention, which is the part you’re consciously aware of right now. In this view, working memory isn’t a separate box in your brain. It’s just the portion of memory that attention is currently illuminating.
How many items can that spotlight hold? The debate ranges from one to about four. Recent experimental work suggests the focus of attention can maintain at least two separate items simultaneously, and broader evidence points to an immediate memory limit of roughly three to four items when people aren’t using strategies like grouping or rehearsal. Whether the limit is truly fixed or reflects a rapidly rotating spotlight remains an open question, but the core point holds: attention sets the capacity ceiling for working memory.
Filtering: The Gatekeeper Role
One of the clearest demonstrations of how attention shapes working memory comes from research on filtering. Your brain constantly encounters more information than it can store. Attention acts as a bouncer, deciding what gets in and what stays out. People who are better at filtering irrelevant distractors consistently show higher working memory capacity.
A study testing adults, seven-year-olds, and four-year-olds found that filtering efficiency predicted working memory capacity across all three age groups. The correlation was strongest in adults (r = 0.61) and still significant in four-year-olds (r = 0.45). Even after accounting for age and other factors, filtering ability explained an additional 13% of the variation in working memory capacity. This means that a meaningful chunk of why one person can hold more in mind than another comes down to how effectively their attention screens out distractions, not just how much raw storage space they have.
They Share the Same Brain Networks
Neuroimaging research confirms that attention and working memory aren’t just conceptually linked. They activate the same regions. When people perform tasks requiring attentional selection (picking out a target from distractors) and tasks requiring working memory (holding an item’s features in mind), the brain activity patterns in visual and parietal cortex are essentially interchangeable. Frontal cortex contributes as well, though to a lesser degree.
The visual cortex handles both selecting a location and storing the features of a remembered item. Parietal and frontal regions focus more on the selection process itself, directing attention to the right target. Brain activity tracking location peaks at the moment of selection and drops to baseline once location is no longer relevant, while activity tracking stored features builds more slowly and persists through a memory delay. This suggests that attention does the initial grabbing, and memory does the holding, but both operations run on shared neural hardware.
Training studies reinforce this overlap. After people practice working memory tasks, activity increases in the middle frontal gyrus and parietal cortex, regions that are also core players in attentional control. Strengthening one system appears to reshape the other.
How This Plays Out in ADHD
The interplay between attention and working memory is especially visible in ADHD, where both systems are disrupted. Research in biological psychiatry has shown that reduced working memory ability in people with ADHD contributes directly to their difficulties with attentional control. The mechanism involves a region called the dorsolateral prefrontal cortex, which maintains what’s called a “task set,” essentially your brain’s internal reminder of what you’re supposed to be doing right now and what counts as relevant information.
In people with ADHD, lower working memory capacity is associated with reduced activity in this region during tasks that require ignoring distractions. The result is a feedback loop: weaker working memory means you can’t maintain the mental template of what’s important, which means irrelevant information breaks through, which further disrupts memory. Some evidence suggests that individuals with ADHD compensate by relying more heavily on reactive, stimulus-driven attention (responding to things as they pop up) rather than proactive, sustained attention (keeping a goal steadily in mind). This is why ADHD often looks less like an inability to pay attention and more like attention that’s pulled by whatever is most immediately salient.
How the Relationship Develops With Age
Working memory capacity grows in a non-linear pattern from early childhood through adolescence. The most rapid gains happen between ages 3 and 10, with a second, smaller burst during early adolescence (roughly ages 13 to 17). Between those growth spurts, late childhood and late adolescence are relatively stable periods. This trajectory closely mirrors the development of attentional control, which makes sense given how tightly the two systems are coupled.
The filtering data from younger children reinforces this connection. Four-year-olds show the weakest filtering efficiency and the lowest working memory capacity. By age seven, both improve substantially. The fact that the correlation between filtering and capacity is already significant in preschoolers suggests that attention isn’t just layered on top of memory at some later developmental stage. The two grow together from the start.
What This Means in Everyday Life
When you try to remember a grocery list while someone is talking to you, both systems are taxed simultaneously. Your attention has to suppress the conversation (filtering), maintain focus on the list items (selection), and your working memory has to hold and refresh those items. If either system falters, the list slips away.
Cognitive tasks that heavily tax this interaction become measurably harder as demands increase. In a common lab task called the N-back, people monitor a stream of items and respond whenever the current item matches one from a few steps earlier. At higher loads (like matching three items back instead of two), participants make more errors specifically because familiar-seeming distractors trick them into false responses. The task challenges your ability to control automatic reactions based on familiarity, which is a pure test of attention regulating what stays in and what gets rejected from working memory.
This is also why multitasking is so draining. Every time you switch tasks, your central executive has to disengage attention from one set of information, suppress it, engage a new set, and update working memory with different goals and content. Each switch carries a cost in both speed and accuracy because you’re reshuffling the same limited attentional resources that working memory depends on to function.

