What Is an Outlier in a Box Plot? The 1.5×IQR Rule

An outlier in a box plot is a data point that falls unusually far from the rest of the data, plotted as an individual dot beyond the plot’s whiskers. The standard rule flags any value more than 1.5 times the interquartile range (IQR) below the first quartile or above the third quartile. This method, built into virtually every statistics tool, gives you a consistent, visual way to spot values that don’t fit the overall pattern.

How the 1.5×IQR Rule Works

A box plot divides your data into four quarters. The box itself spans the middle 50% of values, from the first quartile (Q1) at the bottom edge to the third quartile (Q3) at the top edge. The IQR is simply Q3 minus Q1, representing the spread of that middle half.

To determine which points count as outliers, you calculate two boundaries called “fences”:

  • Lower fence: Q1 − 1.5 × IQR
  • Upper fence: Q3 + 1.5 × IQR

Any data point that falls below the lower fence or above the upper fence is marked as an outlier. For example, if Q1 is 80 and Q3 is 90, the IQR is 10. Multiply by 1.5 to get 15. The lower fence lands at 65 and the upper fence at 105. A score of 110 would be flagged; a score of 60 would be flagged. Anything between 65 and 105 stays inside the normal range.

Where the Whiskers End

This is the detail that trips people up. The whiskers on a box plot do not automatically extend to the minimum and maximum values in the data set. When outliers exist, each whisker extends only to the farthest data point that is still inside the fence. The outliers then appear as isolated dots beyond the whiskers.

So if your data set has a maximum value of 110 but the upper fence is 105, the upper whisker stops at whatever the highest non-outlier value is (say, 103), and the 110 gets plotted as a lone dot above it. This is why box plots are so useful for spotting unusual values at a glance: the dots literally stand apart from everything else.

Mild vs. Extreme Outliers

Not all outliers are equally unusual. NIST (the National Institute of Standards and Technology) distinguishes between two levels:

  • Mild outliers: Points beyond 1.5 × IQR from the box edges but within 3 × IQR.
  • Extreme outliers: Points beyond 3 × IQR from the box edges.

Using the same example where the IQR is 10, a mild outlier on the upper end would fall between 105 and 120. An extreme outlier would be anything above 120. The distinction matters because extreme outliers are far more likely to represent errors, truly rare events, or a fundamentally different process generating that data point. Some box plot tools use different symbols (open circles vs. filled dots, or asterisks) to separate the two categories, though not all do.

Why 1.5 Specifically

The 1.5 multiplier comes from the statistician John Tukey, who introduced it in the 1970s as a practical threshold. It’s not derived from a strict mathematical proof. Instead, it strikes a balance: strict enough to catch genuinely unusual values, lenient enough to avoid flagging ordinary variation. In a perfectly bell-shaped (normal) distribution, roughly 0.7% of data points would fall outside the 1.5 × IQR fences, so you’d expect about 7 outlier flags per 1,000 observations even when nothing unusual is happening.

This is the default in R, Python’s matplotlib, and most other statistical software. Changing it is possible but generally discouraged because people expect box plot whiskers to represent 1.5 × IQR. Using a different multiplier without labeling it clearly can lead readers to draw wrong conclusions from your chart.

What Outliers Actually Tell You

Being flagged as an outlier doesn’t automatically mean a data point is wrong or should be removed. An outlier is a signal worth investigating, not a verdict. There are three common explanations for why a point ends up outside the fences.

First, it could be a data entry or measurement error. A decimal in the wrong place or a sensor glitch can produce a value that looks extreme but is simply incorrect. These are the easiest cases: once confirmed as errors, they should be corrected or removed.

Second, the value might be real but rare. In salary data, for instance, a CEO’s compensation will almost always show up as an outlier compared to the rest of the company. That’s not a mistake. It’s a genuine feature of the distribution, and removing it would misrepresent the data.

Third, the outlier might signal that your data comes from more than one group. If you’re plotting heights and accidentally include both children and adults, you’ll likely see outliers that are actually just a separate population mixed into the same data set. In this case, the fix isn’t removing outliers but splitting the data into meaningful subgroups.

A Worked Example

Suppose you collected test scores from 15 students: 42, 55, 62, 65, 68, 70, 72, 74, 76, 78, 80, 82, 85, 88, 99. The first quartile (Q1) is 65, the third quartile (Q3) is 83.5, and the IQR is 18.5. Multiply 18.5 by 1.5 to get 27.75.

The lower fence is 65 − 27.75 = 37.25, and the upper fence is 83.5 + 27.75 = 111.25. Every score in this data set falls within those fences, so there are no outliers. The whiskers extend to 42 on the low end and 99 on the high end. Now imagine one student scored 12 instead of 42. That score falls below 37.25, so it would appear as a dot below the lower whisker, and the whisker itself would stop at the next lowest non-outlier value (55).

Walking through the math once like this makes it much easier to read any box plot you encounter. The dots aren’t decorations. Each one is a specific data point that cleared a precise numerical threshold, and knowing how that threshold is calculated lets you judge just how unusual the value really is.