What Is a Reinforcer? Definition, Types, and Examples

A reinforcer is any consequence that increases the likelihood of a behavior happening again. If you do something and the outcome makes you more likely to repeat it, that outcome is a reinforcer. This is the core concept behind operant conditioning, the learning process where behavior is shaped by what follows it. A reinforcer is defined entirely by its effect: if a consequence doesn’t actually increase the behavior, it’s not functioning as a reinforcer, no matter how appealing it seems.

How Reinforcers Differ From Punishers

The simplest distinction in behavioral psychology is this: reinforcers increase behavior, punishers decrease it. Both are consequences that follow a behavior, but they push in opposite directions. If your dog sits and gets a treat, and the sitting happens more often afterward, the treat is a reinforcer. If your dog jumps on a guest and gets sprayed with water, and the jumping decreases, the water spray is a punisher.

This gets slightly more nuanced with negative reinforcement, which trips people up. Negative reinforcement still increases behavior, but it does so by removing something unpleasant. Taking an aspirin removes a headache, so you’re more likely to take aspirin next time you have one. The headache going away is a negative reinforcer. A punisher, by contrast, always makes a behavior less likely. The key question is always: did the behavior go up or down?

Primary and Secondary Reinforcers

Primary reinforcers satisfy basic biological needs and require no learning. Food, water, sleep, warmth, and sex all function as reinforcers without any prior experience. A newborn doesn’t need to learn that milk is rewarding.

Secondary reinforcers (also called conditioned reinforcers) only gain their power through association with primary reinforcers. Money is the classic example. Paper currency has no biological value, but because it can be exchanged for food, shelter, clothing, and other necessities, it becomes a powerful reinforcer on its own. Verbal praise works the same way: a parent saying “good job” becomes reinforcing because it’s been repeatedly paired with affection, attention, and tangible rewards.

Token economies build on this principle. In classrooms or treatment settings, people earn tokens, chips, or stars for specific behaviors and later exchange them for items they actually want. The tokens themselves are secondary reinforcers, and they work because of their link to meaningful rewards down the line.

What Happens in the Brain

When you receive a reinforcer, dopamine neurons in the midbrain fire in a specific pattern. These neurons don’t simply respond to rewards. They respond to the gap between what you expected and what you got. An unexpected reward triggers a burst of dopamine activity. A reward you fully predicted produces little or no dopamine response. And when an expected reward fails to arrive, dopamine activity actually dips below baseline.

This system, called reward prediction error, is what drives learning. The dopamine signal travels to the striatum, the rest of the basal ganglia, and the frontal cortex, ultimately reaching the motor systems that control your actions. Neurons in the brain’s decision-making regions code for variables like the value of an object or the value of an action, steering your choices toward whichever option is associated with the strongest reward signal. Animal studies confirm this directly: monkeys and rodents will repeat behaviors that produce dopamine excitation, and many of the brain sites where electrical stimulation is most rewarding overlap with dopamine pathways.

Schedules of Reinforcement

How often and when a reinforcer is delivered matters as much as what the reinforcer is. Reinforcement schedules fall into two broad categories: continuous and partial. Continuous reinforcement means the behavior is reinforced every single time it occurs. This is ideal for teaching a new behavior because it creates a clear, immediate connection between the action and the outcome.

Partial reinforcement means the behavior is only reinforced some of the time, and it comes in four patterns:

Fixed-ratio: Reinforcement comes after a set number of responses. A factory worker paid per 10 units produced is on a fixed-ratio schedule. This generates a high, steady response rate.
Variable-ratio: Reinforcement comes after an unpredictable number of responses. Slot machines operate on this schedule, which also produces high, steady responding and is the most resistant to extinction.
Fixed-interval: Reinforcement comes after a set amount of time. A weekly paycheck is an example. People tend to slow down right after receiving the reinforcer and ramp up as the next delivery approaches.
Variable-interval: Reinforcement comes after unpredictable time periods. Checking your email for a message you’re waiting on follows this pattern. It produces a slow but steady response rate.

When reinforcement stops entirely, the behavior gradually fades, a process called extinction. Behaviors maintained on variable schedules are significantly more resistant to extinction than those on fixed schedules. This is why habits built on unpredictable rewards, like checking social media, can be so persistent.

Activities as Reinforcers: The Premack Principle

A reinforcer doesn’t have to be a thing. It can be an activity. The Premack Principle states that a behavior someone is more likely to do on their own can reinforce a behavior they’re less likely to do. The common parenting version: “You can go to the park after you finish your homework.” Going to the park is the high-probability behavior, homework is the low-probability behavior, and access to one is contingent on completing the other.

This principle was demonstrated experimentally when researchers used a chimpanzee’s preference for eating fruit to reinforce learning the fundamentals of language. The chimp was unlikely to practice language tasks on its own, but pairing those tasks with fruit access made them far more likely. The takeaway is broad: anything a person or animal already prefers doing can serve as a reinforcer for something they’d rather avoid.

What Makes a Reinforcer Effective

Not all reinforcers are equally powerful, and several factors determine how well one works.

Timing is critical. A reinforcer delivered immediately after a behavior creates a stronger association than one delivered after a delay. This is why clicker training works well with animals: the click sound bridges the gap between the behavior and the food reward that follows a few seconds later.

Frequency and size interact in surprising ways. Research on animals choosing between food sources found that a given quantity of food generated more responding when it was delivered frequently in small amounts than when it was delivered infrequently in large amounts. In practical terms, several small rewards spaced across a task can be more motivating than one large reward at the end.

Deprivation and satiation change a reinforcer’s value dramatically. A reinforcer is most powerful when someone has been deprived of it. Food is far more reinforcing when you’re hungry. Conversely, satiation, having unrestricted access to a reinforcer, reduces its effectiveness. Studies show that providing free access to a specific type of food before a session systematically decreases the amount of effort an animal will put in to earn that same food. Interestingly, satiation on one food type primarily reduces motivation for that specific food more than for other types, suggesting that satiation is somewhat targeted rather than producing a general loss of motivation.

How Reinforcers Are Identified in Practice

Because a reinforcer is defined by its function, not its appearance, what works for one person may not work for another. In applied behavior analysis, practitioners use formal preference assessments rather than guessing. These assessments come in several forms: surveys that ask the person (or their caregivers) what they prefer, verbal choice procedures where someone picks between named options, and pictorial choice procedures where someone selects from images of potential reinforcers.

Research comparing these methods found that verbal and pictorial choice assessments accurately identified both high- and low-preference items for most participants. Surveys alone tended to rate many categories as highly preferred, making it harder to distinguish what would actually motivate behavior in practice. The most reliable approach typically involves testing potential reinforcers directly: offering them as consequences and measuring whether the target behavior actually increases. If it does, it’s a reinforcer. If not, regardless of how much someone said they wanted it, it isn’t functioning as one.