What Is the First Step in Memory Formation?

The first step in memory formation is encoding, the process by which your brain converts sensory experiences into a neural trace that can be stored and later retrieved. Before any memory can be saved or recalled, the information has to be written into the brain’s circuitry. This happens in seconds to minutes and involves a rapid chain of events, from raw sensory input to chemical changes at the connections between brain cells.

Sensory Memory: Where It All Starts

Every memory begins with your senses. When you see, hear, touch, smell, or taste something, that raw information enters what’s called sensory memory, a very brief holding area that lasts only about 0.2 to 2 seconds. Visual impressions typically fade within 1 second, while sounds can linger slightly longer. Sensory memory has a large capacity, meaning your brain takes in far more information than you’re consciously aware of, but almost all of it disappears almost immediately.

The critical transition happens around a quarter of a second after the stimulus arrives. At roughly that point, information begins transferring from sensory memory into short-term (working) memory. But here’s the catch: only a tiny fraction of what enters sensory memory makes that jump. The gatekeeper is attention.

How Attention Filters What Gets Encoded

Your brain doesn’t encode everything it senses. Selective attention acts as a filter, determining which pieces of incoming information are strong enough to move forward in the memory process. When you focus on something, the brain regions responsible for processing that specific type of information become more active, and that heightened activity makes the experience more likely to be encoded.

This filtering happens through a partnership between the outer layers of the brain (cortical regions that process what you see, hear, and feel) and the hippocampus, a deeper structure essential for forming new memories. The hippocampus responds selectively to whatever you’re paying attention to. If you’re focused on the color of an object, the hippocampus encodes color. If you’re focused on location, it encodes location. Features you aren’t attending to simply don’t get written into the memory trace. This is why you can walk through a crowded room and remember the face of someone you were looking at but have no memory of the music playing in the background.

What Happens in the Brain During Encoding

At the cellular level, encoding relies on strengthening the connections between neurons, a process called long-term potentiation. When a particular experience activates a group of neurons, the first thing that happens is a burst of chemical signaling at the junctions (synapses) between them. The sending neuron releases a chemical messenger called glutamate, which triggers receptors on the receiving neuron.

One type of receptor acts like a molecular “and” gate. It only opens when two conditions are met simultaneously: the chemical messenger is present, and the receiving neuron is already electrically active. When both conditions align, calcium floods into the receiving neuron. That calcium surge is the critical trigger for encoding. It kicks off a cascade of chemical reactions that make the connection between those two neurons stronger and more responsive, so the same signal will pass more easily next time. This initial strengthening phase lasts from seconds to a few hours and relies on modifying proteins that are already present in the cell, without needing to build new ones.

The physical structure of the connection point itself also changes during this phase. The tiny protrusion on the receiving neuron (called a spine) enlarges, giving the two neurons a bigger surface area for communication. All of this happens within seconds to minutes of the original experience.

The Brain Regions That Work Together

Encoding isn’t handled by a single brain area. The hippocampus captures moment-to-moment details of an experience, stamping them with information about where and when they occurred. Meanwhile, the prefrontal cortex plays a different role: it sorts incoming information by similarity, compares it to what arrived a moment ago, and extracts the central meaning of the experience. It then sends signals back to the hippocampus that highlight which details are most relevant, essentially telling the hippocampus what to prioritize.

Emotional experiences get an additional boost. When something feels threatening or rewarding, a brain structure called the amygdala sends modulatory signals that strengthen the encoding process. This is why emotionally charged events tend to form stronger, more vivid memories than neutral ones.

Three Ways Your Brain Encodes Information

Not all encoding is equally effective. Your brain can encode information in at least three different ways, and the method matters for how well you’ll remember it later.

  • Acoustic encoding processes the sounds of words and language. When you repeat a phone number out loud to yourself, you’re encoding it acoustically.
  • Visual encoding processes images and spatial information. Picturing a map or a person’s face uses this pathway.
  • Semantic encoding processes meaning. When you understand what something means and connect it to things you already know, you’re encoding it semantically.

Of these three, semantic encoding produces the strongest long-term memories. Words and concepts that are encoded by their meaning are remembered significantly better than those encoded only by how they sound or look. This is because understanding meaning involves deeper processing, engaging more neural networks and creating more connections to existing knowledge. High-imagery words get a double advantage: they’re encoded both visually and semantically, building an even stronger memory trace.

Why Encoding Sometimes Fails

When you forget where you put your keys or can’t recall someone’s name moments after being introduced, the problem usually isn’t retrieval. The memory was never properly encoded in the first place. This is called encoding failure, and it’s the most common reason for everyday forgetting.

Distraction is the primary culprit. Because attention is the gatekeeper for encoding, anything that divides your attention during an experience reduces the quality of the resulting memory. Multitasking while studying, scrolling your phone during a conversation, or simply being lost in thought when someone tells you their name all prevent full encoding.

Stress and negative emotions can also disrupt encoding in specific ways. When people are exposed to emotionally negative stimuli, the surrounding context (where they were, what else was happening) tends to be encoded poorly. This appears to happen because the emotional content hijacks attentional resources, weakening the connection between the amygdala and prefrontal cortex that normally supports contextual encoding. Higher levels of cortisol, the body’s primary stress hormone, are also associated with poorer recall of contextual details a day later.

How to Encode More Effectively

Since encoding is the foundation of all memory, improving this first step pays dividends for everything that follows. The single most impactful strategy is elaboration: connecting new information to things you already know, building it into sentences, or creating mental images. In studies comparing different learning strategies, participants who wove new words into meaningful sentences remembered significantly more on delayed memory tests than those who simply had extra time to study. The benefit of elaboration was specifically tied to long-term retention, not just short-term recall.

Combining strategies amplifies the effect. Forming a mental image while also connecting information to its meaning engages both visual and semantic encoding pathways, creating multiple routes back to the same memory. Chunking, or grouping individual pieces of information into meaningful clusters, reduces the load on working memory and gives you more room to elaborate on each chunk.

The flip side is equally important: minimize divided attention during moments you want to remember. Putting your phone away during a lecture, making eye contact when someone introduces themselves, or pausing to actually look at where you’re setting down your keys gives your brain’s encoding machinery the focused input it needs to write a lasting trace.