A reinforcer in ABA (applied behavior analysis) is any consequence that follows a behavior and makes that behavior more likely to happen again in the future. It’s not just a treat or a sticker. If a consequence doesn’t actually increase the behavior it follows, it’s not functioning as a reinforcer, no matter how appealing it seems. This distinction is central to how ABA works: reinforcement is defined by its effect on behavior, not by how “nice” the item or event appears.
Reinforcers vs. Rewards
People often use “reward” and “reinforcer” interchangeably, but they mean different things in behavioral science. A reward is something generally pleasant that you give someone. A reinforcer is specifically a consequence that strengthens a behavior. The difference matters because something can feel like a reward without actually changing behavior. If you praise a child every time they raise their hand, but they never raise their hand more often, praise isn’t functioning as a reinforcer for that child in that context.
This is why ABA practitioners test whether something works rather than assuming it will. A toy might be highly motivating for one child and completely uninteresting to another. The label “reinforcer” is earned through observable results.
Positive and Negative Reinforcement
These terms trip people up because “positive” and “negative” don’t mean “good” and “bad.” They refer to whether something is added or removed.
- Positive reinforcement means a stimulus is added after a behavior, and the behavior increases. A teacher gives a high-five after a student completes a task, and the student starts completing tasks more consistently.
- Negative reinforcement means a stimulus is removed after a behavior, and the behavior increases. A child puts on sunglasses to reduce glare, and continues wearing sunglasses on sunny days because the discomfort goes away.
Both types strengthen behavior. Negative reinforcement is not punishment. Punishment decreases behavior; reinforcement always increases it.
Primary and Conditioned Reinforcers
Primary reinforcers (also called unconditioned reinforcers) are things that are inherently reinforcing because they meet biological needs. Food, water, warmth, and relief from pain all fall into this category. No one has to learn that food is satisfying when they’re hungry.
Conditioned reinforcers (also called secondary reinforcers) gain their power through association with other reinforcers. Money is the classic example: a dollar bill has no biological value, but it’s been paired so many times with things that do (food, shelter, comfort) that it functions as a powerful reinforcer on its own. In ABA settings, common conditioned reinforcers include tokens or points that can be exchanged for preferred items, verbal praise, access to a favorite toy, screen time, or music. These work because the individual has learned to associate them with something they value.
The goal in most ABA programs is to move away from primary reinforcers like food toward conditioned reinforcers like praise and social interaction, since those are what naturally maintain behavior outside of therapy.
Common Categories of Reinforcers
ABA practitioners generally organize reinforcers into several functional categories:
- Edible: food or drink items (chips, juice, small candy)
- Tangible: physical objects like toys, stickers, or books
- Activity: access to preferred activities like playing outside, drawing, or watching a video
- Social: attention, praise, hugs, or interaction with a preferred person
- Sensory: experiences that provide specific sensory input, such as listening to music, playing with textured materials, or spinning in a chair
What falls into each category varies enormously between individuals. A sensory experience that one child finds reinforcing might be aversive to another. This is why individual assessment is so important in ABA.
How Practitioners Identify Reinforcers
ABA therapists don’t guess what a person finds reinforcing. They use structured preference assessments to figure it out. Several standard methods exist, and each has trade-offs in terms of accuracy and time.
In a paired-stimulus assessment, the practitioner presents two items at a time and asks the individual to pick one. After 20 to 25 trials covering different combinations, a clear hierarchy of preferences emerges. This method is more accurate than simply offering one item at a time, but it takes longer to complete.
The multiple stimulus without replacement (MSWO) approach presents five to seven items at once. The individual picks one, engages with it briefly, and then the remaining items are re-presented without the chosen one. This repeats until all items have been selected or the individual stops choosing. MSWO generates a ranked preference list more quickly than paired-stimulus assessments, though it requires the individual to scan all the options and attend to the array.
Free-operant assessments simply give the individual open access to a variety of items and track what they naturally gravitate toward and how long they engage with each one. Parent and caregiver interviews also help identify potential reinforcers before formal testing begins.
These assessments need to be repeated regularly, because preferences shift. What was highly motivating last month may hold no interest today.
Why Timing Matters
For a reinforcer to work, it needs to be delivered close in time to the behavior it’s meant to strengthen. Research on this is clear: when reinforcement is immediate or delayed by only about one second, animals and humans learn new behaviors quickly, sometimes within a single session. When the delay stretches to 10 seconds, learning can stall dramatically. In one study, a pigeon failed to learn a simple response across more than ten hours of training when reinforcement was delayed by just 10 seconds.
In practical ABA settings, this means therapists aim to deliver reinforcement within one to two seconds of the target behavior. Delays of around 30 seconds make it difficult for the learner to connect the behavior with the consequence. This is also why token systems are useful: a token can be delivered instantly even when the actual backup reinforcer (like a preferred activity) can’t be provided until later.
Motivating Operations Change a Reinforcer’s Value
A reinforcer’s effectiveness isn’t fixed. It fluctuates based on what behavior analysts call motivating operations. These are conditions that temporarily change how much someone values a particular consequence.
The simplest example is hunger. Food is a powerful reinforcer when you haven’t eaten in hours. Right after a large meal, that same food holds little appeal. In ABA terms, food deprivation is an establishing operation because it increases the value of food as a reinforcer and makes food-seeking behavior more likely. Being full is an abolishing operation because it decreases food’s reinforcing value.
This concept applies to all reinforcer types. A child who has been playing alone all morning may find adult attention highly reinforcing. The same child, after an hour of one-on-one interaction, may not work for attention at all. Skilled practitioners pay attention to these shifting conditions and adjust which reinforcers they use throughout a session.
Reinforcement Schedules Shape Response Patterns
How often reinforcement is delivered matters as much as what the reinforcer is. ABA uses four basic reinforcement schedules, each producing a distinct behavioral pattern.
A fixed-ratio schedule delivers reinforcement after a set number of responses. If a child earns a token for every three math problems completed, that’s a fixed-ratio 3 schedule. This tends to produce fast, steady responding with brief pauses right after reinforcement is delivered.
A variable-ratio schedule delivers reinforcement after an unpredictable number of responses, averaging around a set number. Slot machines operate on this schedule, which is why people keep pulling the lever. Variable-ratio schedules produce the highest, most consistent response rates and are the most resistant to extinction (meaning the behavior persists longest even after reinforcement stops).
A fixed-interval schedule reinforces the first response after a set amount of time has passed. This creates a distinctive “scalloped” pattern where responding is slow right after reinforcement, then accelerates as the next reinforcement window approaches. Studying for a weekly quiz often follows this pattern.
A variable-interval schedule reinforces the first response after an unpredictable amount of time. Checking your email is a good example: messages arrive at irregular intervals, so you check at a steady, moderate rate. This schedule produces consistent but not especially fast responding.
In ABA therapy, new skills are typically taught using continuous reinforcement (reinforcing every correct response) and then gradually shifted to intermittent schedules. This transition is important because real life rarely reinforces every instance of a behavior, and intermittent schedules help the behavior persist in natural settings.
Ethical Use of Reinforcement in ABA
Professional guidelines from the Behavior Analyst Certification Board require practitioners to prioritize reinforcement-based procedures over punishment-based ones. When designing interventions, behavior analysts are expected to select the least restrictive approach that is still effective. This means choosing praise or activity-based reinforcers over food when possible, and using the mildest effective intervention before considering anything more intrusive.
The use of edible reinforcers, while sometimes necessary, comes with additional considerations. Practitioners are expected to assess whether food-based reinforcement is genuinely the least restrictive option and to work toward fading it in favor of more naturally occurring reinforcers like social praise and activity access. The broader ethical framework emphasizes that reinforcement strategies should be individualized, regularly reassessed, and designed to promote the learner’s long-term independence.

