What Are the Principles of Behavior in ABA?

The principles of behavior are a set of fundamental rules that explain why people (and animals) do what they do. At their core, these principles describe a simple relationship: behavior is shaped by what happens before it and what happens after it. Every behavioral principle builds on this idea, from reinforcement and punishment to extinction and stimulus control. Understanding these principles gives you a framework for making sense of habits, learning, motivation, and behavioral change.

The ABC Model: How Behavior Works

The foundation of behavioral science is the three-term contingency, often called the ABC model. It breaks any behavior down into three connected parts: the antecedent (what happens right before), the behavior itself (the observable action), and the consequence (what happens right after). These three elements must be viewed together to understand why a behavior occurs.

The antecedent is anything that sets the stage. A phone buzzing, a teacher giving instructions, a feeling of hunger, a change in routine. Antecedents don’t cause behavior on their own, but they create the context in which behavior happens. The behavior is the actual action you can see and measure. Behavioral science focuses on what can be observed and counted rather than assumptions about intent or emotions. The consequence is what follows the behavior, and it determines whether that behavior is more or less likely to happen again.

This framework is the lens through which every other behavioral principle operates. Reinforcement, punishment, extinction, and stimulus control are all descriptions of specific relationships between these three components.

Two Types of Behavior

Behavioral science distinguishes between two broad categories. Respondent behavior is reflexive. It’s elicited automatically by something in the environment: you flinch at a loud noise, your mouth waters at the smell of food, your pupils constrict in bright light. These are stimulus-response relationships that don’t require learning, though they can be conditioned (as in Pavlov’s famous experiments with dogs).

Operant behavior is voluntary. It’s any behavior whose future frequency is determined by its history of consequences. If something good followed a behavior in the past, you’re more likely to do it again. If something unpleasant followed, you’re less likely. Operant behavior is selected, shaped, and maintained by consequences. Most of the principles below deal with operant behavior, because it’s the type most relevant to learning, habits, and intentional change.

Reinforcement: Increasing Behavior

Reinforcement is any consequence that increases the likelihood of a behavior being repeated. It comes in two forms, and the “positive” and “negative” labels don’t mean “good” and “bad.” They mean “adding” and “removing.”

Positive reinforcement involves providing something desirable after a behavior. A child completes homework and earns screen time. An employee hits a sales target and receives a bonus. Praise, attention, privileges, and tangible rewards all function as positive reinforcers when they make the behavior more likely to happen again. The key test is always whether the behavior actually increases in frequency. If you’re giving praise but the behavior isn’t changing, it’s not functioning as reinforcement for that person.

Negative reinforcement increases behavior by removing something unpleasant. You take an aspirin (behavior) and your headache goes away (removal of something aversive), making you more likely to reach for aspirin next time. A student answers a difficult question correctly, and the teacher stops calling on them repeatedly. The removal of discomfort or pressure is what strengthens the behavior.

Punishment: Decreasing Behavior

Punishment is the opposite of reinforcement. It’s any consequence that decreases the likelihood of a behavior being repeated. Again, “positive” means adding something and “negative” means taking something away.

Positive punishment adds an undesirable stimulus to decrease a behavior. A child touches a hot stove and feels pain. A driver speeds and receives a ticket. The added consequence makes the behavior less likely in the future. Negative punishment removes a pleasant stimulus to decrease a behavior. A teenager breaks curfew and loses car privileges. A child acts out and is removed from a fun activity. The loss of something valued is what reduces the behavior.

Punishment can suppress behavior quickly, but it has limitations. It tells a person what not to do without teaching them what to do instead, and it can produce side effects like avoidance, anxiety, or aggression. For this reason, behavioral approaches generally favor reinforcement-based strategies.

Extinction: When Consequences Stop

Extinction occurs when a behavior that was previously reinforced no longer produces that reinforcement. Over time, the behavior decreases and eventually stops. A child who used to get attention by whining finds that the adults around them no longer respond to whining. Without the reinforcing consequence, the behavior fades.

But there’s a predictable complication. When extinction first begins, you’ll often see what’s called an extinction burst: a sudden, temporary increase in the behavior. The person tries harder, does the behavior more intensely or more frequently, before it begins to decline. A child whose tantrums previously earned attention will likely throw bigger tantrums before giving up. Novel behaviors, emotional responses, or even aggression can appear during this phase. This burst is temporary, but it’s important to expect it, because giving in during an extinction burst teaches the person that escalation works.

Stimulus Control and Discrimination

Not all behavior happens everywhere, all the time. Stimulus control describes how specific environmental cues come to signal when a behavior will or won’t be reinforced. When a behavior occurs reliably in the presence of a particular stimulus and not in its absence, that behavior is under stimulus control.

This develops through discrimination training. A response is reinforced in the presence of one stimulus (called a discriminative stimulus) and not reinforced in the presence of another. Over time, the person learns to respond differently depending on the context. You answer your phone when it rings but not when it’s silent. You raise your hand in a classroom but not at a dinner table. The environmental cue tells you which behavior will pay off.

Generalization is the flip side. It happens when behavioral patterns established in one situation spread to similar situations. A child who learns to say “please” at home begins saying it at school and at friends’ houses. The behavior extends beyond the original training context to new environments and new people. Both discrimination and generalization are essential to flexible, adaptive behavior.

Motivating Operations: Why Timing Matters

The same consequence doesn’t always have the same effect. A glass of water is powerfully reinforcing when you’re dehydrated and nearly meaningless when you’ve just had three glasses. Motivating operations are the conditions that change how valuable a consequence is at any given moment.

An establishing operation increases the effectiveness of a reinforcer and increases behaviors associated with getting it. Going without food for several hours is an establishing operation that makes food more reinforcing and makes food-seeking behavior more likely. An abolishing operation does the opposite: it decreases a reinforcer’s effectiveness. Eating a large meal is an abolishing operation for food. You’re less motivated to seek it out, and behaviors that previously led to food drop off.

This principle explains why the same reward or consequence can work beautifully one day and fall flat the next. The internal state of the person, shaped by deprivation, satiation, or other contextual factors, changes what functions as an effective consequence.

Schedules of Reinforcement

In real life, behavior isn’t reinforced every single time it occurs. The pattern in which reinforcement is delivered, called the schedule of reinforcement, has a powerful effect on how quickly behavior is learned and how persistent it becomes.

There are four basic schedules. Fixed-ratio schedules reinforce after a set number of responses. A factory worker paid per unit produced, or a coffee shop punch card that gives a free drink after ten purchases, both operate on fixed-ratio schedules. These generate high, steady rates of behavior with a brief pause after each reinforcement. Variable-ratio schedules reinforce after an unpredictable number of responses. Slot machines are the classic example. Because you never know which pull will pay off, these schedules produce very high and persistent response rates, and the behavior is extremely resistant to extinction.

Fixed-interval schedules reinforce the first response after a set amount of time has passed. Checking the mailbox once a day, or studying more intensely as an exam approaches, reflects this pattern. Behavior tends to slow down right after reinforcement and then accelerate as the next opportunity approaches. Variable-interval schedules reinforce the first response after an unpredictable amount of time. Checking your email when notifications arrive at random intervals is a good example. These schedules produce steady, moderate rates of responding because there’s no way to predict when the next reinforcement will be available.

The practical takeaway is that behavior reinforced on an unpredictable schedule is much harder to extinguish than behavior reinforced every time. This is why some habits and behaviors feel so stubborn: they were built on intermittent reinforcement.

Putting the Principles Together

None of these principles operate in isolation. At any moment, a person’s behavior is influenced by the antecedent context, their current motivating operations, the history of consequences for that behavior, and the schedule on which those consequences have been delivered. A child acting out in a grocery store might be doing so because they’re tired and hungry (establishing operation), the store environment has historically been a context where tantrums produce candy (stimulus control and reinforcement history), and the candy has come intermittently (variable-ratio schedule), making the behavior highly resistant to change.

In 1968, researchers Baer, Wolf, and Risley outlined seven dimensions that any systematic application of these principles should meet: it should target socially meaningful behaviors, focus on observable and measurable actions, rely on evidence-based methods, describe procedures clearly enough for anyone to follow, use research-grounded techniques, produce practical improvements, and ensure that changes last across time and settings. These dimensions remain the standard for applying behavioral principles in education, therapy, workplace training, and everyday life.