What Is the Marshmallow Test and Does It Hold Up?

The marshmallow test is a psychology experiment from the early 1970s that measures a young child’s ability to resist a treat now in exchange for a bigger reward later. A researcher places a single marshmallow (or cookie, or pretzel) in front of a preschooler, tells the child they can eat it anytime, but if they wait until the researcher comes back, they’ll get two treats instead. The child is then left alone for up to 15 minutes. What made this simple setup famous wasn’t the experiment itself but the bold claim that followed: that how long a child waited could predict their success decades later. That claim has since been substantially challenged.

How the Original Experiment Worked

The test was developed by psychologist Walter Mischel at Stanford University, conducted with children attending the campus’s Bing Nursery School. Kids were typically three to five years old. The setup was deliberately simple: one treat on a table, an empty room, and a bell the child could ring at any time to bring the researcher back (ending the wait but forfeiting the second treat).

What fascinated researchers wasn’t just whether kids waited, but how they managed to wait. Some children whispered instructions to themselves, repeating the deal out loud: “If I wait, then I get both; but if I don’t, then I just get one.” Others turned away from the treat, covered their eyes, or sang songs. The most effective strategy turned out to be mental reframing. When children were told to imagine the marshmallow was “just a picture” and to “put a frame around it in your head,” they managed to wait an average of almost 18 minutes. In later demonstrations, one child came up with the idea of imagining that cookies were smelly fish, making them unappealing enough to resist.

These observations led Mischel to describe two mental systems at work. A “hot” system responds to the immediate appeal of the reward: the marshmallow is sweet, chewy, and right there. A “cool” system uses abstract thinking to override that impulse, turning the real treat into a mental picture or reframing the situation entirely. The test, in Mischel’s view, wasn’t measuring some fixed trait. It was revealing whether children had stumbled onto cognitive strategies that made waiting easier.

The Famous Claims About Life Outcomes

Starting in the 1980s, Mischel’s team followed up with the original Bing Nursery School participants as they grew into teenagers and adults. The early findings were striking: kids who had waited longer reportedly showed stronger academic and social skills, scored higher on the SAT, and were less prone to frustration as teenagers. Later follow-ups suggested links to lower rates of drug use, better mental health, and healthier body weight through their 20s and 30s.

These results turned the marshmallow test into one of the most famous experiments in psychology. It became a staple of parenting books, TED talks, and corporate training seminars, often simplified into a single lesson: teach your child self-control and they’ll succeed in life.

Why Those Claims Didn’t Hold Up

The original studies had a significant limitation: they drew from a small, homogeneous group of children, mostly from well-off families connected to Stanford University. When researchers ran a much larger version of the experiment with a sample that better reflected the racial and economic diversity of American children, the story changed considerably.

A 2018 replication study found that while children who waited longer did tend to have slightly stronger math and reading skills in adolescence, the association was small. More importantly, it disappeared once researchers accounted for characteristics of the child’s family and early environment, things like household income and the mother’s education level. The authors concluded that programs focused only on teaching young children to delay gratification are unlikely to be effective on their own.

A later study followed the original Bing Nursery School children into their 40s and found even less support for the test’s predictive power. Kids who had grabbed the marshmallow immediately were, by midlife, generally no more or less financially secure, educated, or physically healthy than their patient peers. The researchers reexamined earlier findings on SAT scores, smoking, and body weight and did not see the same strong relationships. Their summary was blunt: “With the marshmallow waiting times, we found no statistically meaningful relationships with any of the outcomes that we studied.” Mischel himself had acknowledged that larger, more diverse samples might produce smaller effects and that home environment likely mattered more than the test itself could capture.

What the Test Actually Measures

One of the most important challenges to the marshmallow test came from research on environmental trust. In a clever variation, researchers first had children do an art project with an experimenter. For some kids, the experimenter was reliable: they promised new crayons and delivered. For others, the experimenter broke their promise. Then both groups did the marshmallow test.

The results were dramatic. Children who had experienced a reliable experimenter waited an average of 12 minutes. Children who had dealt with an unreliable experimenter waited only about 3 minutes. Same age group, same setup, wildly different outcomes. The researchers argued that not waiting for the marshmallow isn’t necessarily a failure of self-control. For a child who has learned that adults don’t keep promises, eating the treat immediately is a perfectly rational decision. Why wait for a reward that might never come?

This reframes what the marshmallow test captures. Rather than measuring some innate willpower, it likely reflects a mix of factors the child brings into the room: their home stability, their experience with reliable adults, their socioeconomic background, and whether they happen to know a useful mental trick like pretending the treat is a picture. A child growing up in an unpredictable environment isn’t showing a character flaw by grabbing the marshmallow. They’re showing good judgment based on their experience.

The “Cool” Strategies That Actually Work

Even though the test doesn’t predict life outcomes the way it was originally claimed to, the research on how children resist temptation remains genuinely useful. Mischel’s team documented specific techniques that reliably extended wait times in children as young as four.

The most powerful strategy was distancing. When kids imagined the treat was a photograph rather than a real object, their wait times jumped dramatically. Physical distancing worked too: children who turned their chair around or covered the treat lasted longer than those who stared at it. Distraction, like singing or playing with their hands, was moderately effective. The least effective approach was what most people assume willpower looks like: sitting still, staring at the treat, and trying to resist through sheer determination.

These findings influenced broader research on self-regulation. The core insight is that resisting temptation isn’t about having a stronger will. It’s about changing how you perceive the temptation in the first place. Reframing a craving, creating mental distance from an impulse, or simply removing the cue from view are all more effective than white-knuckling through it. That principle has held up far better than the marshmallow test’s claims about predicting anyone’s future.

Why It Still Matters

The marshmallow test occupies an unusual place in psychology. Its most famous conclusion, that a preschooler’s patience predicts lifelong success, has been largely debunked. But the experiment itself revealed real things about how self-control works, how environment shapes decision-making, and how easily a simple study can be over-interpreted when it tells a story people want to believe. Mischel himself cautioned that “it’s a terrible mistake to think that if a child can’t wait 15 minutes, the child has serious problems.” The broader lesson isn’t about marshmallows. It’s about how much a child’s circumstances, not just their character, shape the choices they make.