Reinforcement is any consequence that increases the likelihood of a behavior happening again. It’s one of the central ideas in psychology, first formalized by B.F. Skinner in the 1930s as part of his work on operant conditioning. The concept is simple at its core: behaviors that produce favorable outcomes get repeated, while behaviors that don’t tend to fade. But the details of how reinforcement works, the different forms it takes, and why it sometimes backfires reveal a surprisingly rich picture of how humans and animals learn.
How Reinforcement Fits Into Operant Conditioning
Skinner coined the term “operant conditioning” in 1937 to describe behavior that is shaped by its consequences. Unlike reflexes (flinching at a loud noise, for example), operant behaviors are things an organism does that affect the environment. A rat presses a lever and gets food. A child says “please” and gets a cookie. In both cases, the consequence of the action makes it more likely to happen again. That consequence is the reinforcer.
In practice, operant conditioning is what most people would simply call habit formation. A reinforcement schedule is any procedure that delivers a reinforcer according to some defined rule, whether that’s every single time the behavior occurs or only occasionally. The schedule matters enormously, as it determines how quickly a behavior is learned, how consistently it’s performed, and how long it persists when the reinforcer disappears.
Positive vs. Negative Reinforcement
This is where confusion tends to start. “Positive” and “negative” here don’t mean good and bad. They mean adding and removing. Positive reinforcement strengthens a behavior by adding something desirable afterward. Negative reinforcement strengthens a behavior by removing something unpleasant.
A classic positive reinforcement example: in one well-known experiment, rats in a cold chamber learned to press a lever that turned on a heat lamp. The addition of warmth reinforced the lever-pressing behavior. In everyday life, getting a compliment after helping someone, earning a paycheck for working, or hearing a satisfying “ding” when you complete a task on an app all work the same way.
Negative reinforcement looks different. A person takes aspirin and their headache goes away. The removal of pain reinforces the behavior of taking aspirin next time a headache strikes. Buckling your seatbelt to stop the car’s warning chime is another example. Researchers note that the core phenomena behind negative reinforcement are escape and avoidance behavior: you learn to do things that either end an unpleasant situation or prevent one from starting.
Interestingly, psychologists have pointed out that the line between these two categories is blurry. Producing a stimulus (positive reinforcement) always involves escaping a situation where that stimulus was absent (negative reinforcement), and vice versa. The rat getting warmth is also escaping cold. In practice, though, the distinction is useful for analyzing why specific behaviors persist.
Reinforcement Is Not the Same as Punishment
People frequently mix up negative reinforcement with punishment. The key difference is the direction of the effect. Reinforcement, whether positive or negative, always increases the likelihood of a behavior. Punishment always decreases it. Negative reinforcement removes something unpleasant to encourage a behavior. Punishment either adds something unpleasant or removes something desirable to discourage a behavior.
A child who cleans their room to stop a parent’s nagging is experiencing negative reinforcement (the nagging stops, so room-cleaning increases). A child who loses screen time for not cleaning their room is experiencing punishment (something desirable is removed, so the failure to clean decreases). Same situation, opposite mechanisms.
Primary and Secondary Reinforcers
Not all reinforcers are created equal. Primary reinforcers satisfy innate biological needs and don’t have to be learned. Food, water, sleep, shelter, sex, and physical touch all fall into this category. They work because your brain is wired to seek them out for survival.
Secondary reinforcers, by contrast, have no built-in value. They only become reinforcing because they’ve been linked to primary reinforcers through experience. Money is the most obvious example. A dollar bill is just paper, but because you can exchange it for food, shelter, and other things you need, it becomes powerfully reinforcing. Praise works similarly: it’s linked to social bonding and affection. Stickers on a child’s behavior chart, tokens in a reward system, and grades in school are all secondary reinforcers. Their power depends entirely on the connection to something the person already values.
How Schedules Shape Behavior
The pattern in which reinforcement is delivered has a dramatic effect on behavior. Psychologists have identified four basic schedules, and each produces a distinct pattern of responding.
- Fixed ratio: Reinforcement comes after a set number of responses (for example, every 10th lever press or every 5th sale). This produces a high response rate, but people and animals tend to pause briefly right after receiving the reinforcer before ramping back up.
- Variable ratio: Reinforcement comes after an unpredictable number of responses. Slot machines work on this schedule, and it’s the most powerful of the four. It produces high, steady response rates with little to no pausing, and behaviors learned this way are the most resistant to extinction.
- Fixed interval: Reinforcement becomes available after a set amount of time (like a paycheck every two weeks). This produces the lowest response rate of the four schedules. People tend to slack off right after reinforcement and increase effort as the next one approaches, creating what psychologists call a “scalloped” response pattern.
- Variable interval: Reinforcement becomes available after unpredictable time periods (like checking your phone for a text that could arrive at any moment). This produces a moderate, steady response rate because there’s no way to predict when the next reinforcer will come.
The practical takeaway: if you want a behavior to be performed consistently and to persist over time, variable schedules outperform fixed ones. Variable ratio schedules, in particular, are both the most productive and the hardest to extinguish.
What Happens When Reinforcement Stops
When reinforcement for a behavior is completely withdrawn, the behavior eventually fades. This process is called extinction. But it rarely happens smoothly. When extinction first begins, there’s typically an extinction burst: a temporary spike in the frequency or intensity of the behavior. A child who has always gotten attention by whining will, when the attention stops, whine louder and more often before eventually giving up. This burst is actually a sign that extinction is working, not that it’s failing.
How quickly a behavior extinguishes depends heavily on how it was reinforced in the first place. Behaviors maintained on variable schedules, especially thin ones (where reinforcement comes only occasionally), are far more resistant to extinction. This is why gambling habits are so persistent. The variable ratio schedule of occasional wins creates behavior that can survive long stretches without any payoff at all.
The Brain’s Reinforcement System
At a biological level, reinforcement is driven largely by dopamine, a chemical messenger in the brain. Dopamine-producing neurons sit in midbrain structures and send signals to areas involved in motivation, learning, and movement. When something rewarding happens, these neurons fire and release dopamine into a network that includes regions responsible for processing pleasure, forming memories, and planning future actions.
This system doesn’t just respond to rewards after the fact. Over time, it learns to anticipate them. The dopamine signal shifts from firing when you receive a reward to firing when you encounter a cue that predicts one. This is why the notification sound on your phone can feel almost as compelling as the message itself. Your brain has learned that the sound predicts something potentially rewarding, and the dopamine system activates in advance.
When External Rewards Backfire
One of the more counterintuitive findings in reinforcement research is the “undermining effect.” When people are already intrinsically motivated to do something (they enjoy it for its own sake), adding an external reinforcer like money or prizes can actually decrease their motivation. Self-determination theory explains this by suggesting that the external reward shifts a person’s sense of control from internal to external. The activity starts to feel like something you do for the reward rather than something you do because you want to.
This has real implications for parenting, education, and workplaces. Paying a child for reading books they already love, or giving bonuses for creative work that employees find inherently satisfying, can sometimes reduce the very behavior you’re trying to encourage. The reinforcer changes the person’s relationship to the activity. This doesn’t mean external reinforcers are always harmful. They’re highly effective for tasks people find boring or difficult. The undermining effect is most pronounced when the activity was already enjoyable and the reward feels controlling.
Reinforcement in Therapy and Daily Life
Reinforcement principles are the foundation of Applied Behavior Analysis (ABA), a therapeutic approach widely used with developmental disabilities. In ABA, positive reinforcement is the primary tool: desired behaviors are followed by rewarding outcomes to build skills and social interactions. Negative reinforcement is also used, where completing a task removes an unpleasant demand. Extinction is applied to reduce challenging behaviors by ensuring they no longer produce any reinforcing consequence.
Outside of clinical settings, reinforcement shows up in behavior contracts, which spell out expected behaviors and the rewards or consequences tied to them. These are used in schools, workplaces, and family settings. The contract makes the reinforcement contingency explicit: if you do X, Y happens.
Even without formal systems, reinforcement is constantly operating in your daily life. The apps on your phone are designed around variable ratio schedules. Your social interactions are shaped by the reinforcement of approval and connection. Your habits, from morning coffee to exercise routines, persist because of the reinforcing consequences that follow them. Understanding reinforcement doesn’t just explain laboratory behavior. It explains a large portion of why you do what you do.

