When to Use Geometric Mean vs. Arithmetic Mean

Use the geometric mean whenever your data points multiply together rather than add together. That single principle covers most situations: compounding investment returns, averaging growth rates, combining ratios, and summarizing data that spans several orders of magnitude. The arithmetic mean (the simple average most people default to) can mislead you in all of these cases, sometimes dramatically.

The Core Principle: Multiplying vs. Adding

The arithmetic mean works when quantities add together to produce a total. If five people carry boxes and you want to know the average load, you add up all the boxes and divide by five. The geometric mean works when quantities multiply together to produce a product. If you want to find the “average” factor among several factors that combine multiplicatively, the geometric mean gives you the right answer.

To calculate it, you multiply all your values together, then take the nth root (where n is the number of values). For two numbers, that’s the square root of their product. For five numbers, the fifth root. This matters because many real-world processes are multiplicative: growth rates compound on each other, ratios scale against each other, and concentrations in biological systems span ranges where a simple average gets pulled toward extreme values.

Investment Returns and Growth Rates

This is the most common real-world application. Say you invest $100 in a stock that returns 10%, then 150%, then loses 30%, then gains 10%. If you calculate the arithmetic mean, you get 35% average annual return. But the geometric mean, which accounts for the way each year’s return compounds on the previous balance, gives you 20.6%. That’s a massive difference, and only the geometric mean reflects what actually happened to your money.

Here’s a clearer example. Imagine $100 growing over five years with returns of 3%, 5%, 8%, -1%, and 10%. Your balance moves like this: $103, then $108.15, then $116.80, then $115.63, then $127.20. The geometric mean return is 4.93% per year, meaning if your investment had grown at exactly 4.93% every year, you’d end up at the same place. The arithmetic mean would overstate your actual performance because it ignores the compounding effect, where gains and losses build on each other.

This geometric mean return is what financial professionals call the compound annual growth rate (CAGR). Whenever you see investment performance reported as an annualized return, it should be calculated this way. If someone presents you with average return figures, it’s worth checking whether they used the arithmetic or geometric mean. The arithmetic version will always be equal to or higher than the geometric version, and for volatile investments, the gap can be significant. The two averages only converge when returns are very stable from year to year.

Data That Spans Orders of Magnitude

Some datasets contain values that range from very small to very large. Bacterial counts in water, antibody concentrations in blood, city populations, income distributions: these often follow what statisticians call a log-normal distribution, where the logarithm of each value is normally distributed. When your data looks like this, the geometric mean captures the center of the distribution far more accurately than the arithmetic mean, which gets dragged upward by a few extreme values.

The U.S. Environmental Protection Agency and the Food and Drug Administration both use geometric means when evaluating bacterial contamination in water. Concentrations of fecal indicator bacteria like E. coli can vary enormously from one sample to the next, so a single high reading would skew a simple average and give a misleading picture of water quality. The geometric mean dampens the effect of those outliers while still reflecting the overall contamination level. This same logic applies in biomedical research, where immune markers, drug concentrations, and cell counts routinely span several orders of magnitude.

Averaging Ratios and Percentages

Ratios are inherently multiplicative, which makes the geometric mean the right tool for averaging them. Consider a simple example: if a stock doubles one year (a ratio of 2.0) and then halves the next (a ratio of 0.5), you’re back where you started. The arithmetic mean of 2.0 and 0.5 is 1.25, which falsely suggests growth. The geometric mean is 1.0, correctly indicating no change.

This principle extends to any situation where you’re averaging proportions, growth factors, or rates of change. Price-to-earnings ratios across companies, speed ratios in mechanical systems, relative risk values in medical studies: all of these are better summarized with a geometric mean. The California State Water Resources Control Board specifically recommends geometric means for datasets “bounded by zero” and covering “several orders of magnitude,” which describes most ratio-based measurements.

Composite Indices Like the HDI

The United Nations Development Programme switched to a geometric mean for the Human Development Index in 2010, and the reason is instructive. The HDI combines three dimensions: health, education, and income. Under the old formula, which used a simple arithmetic average, a country could score poorly in one dimension but fully compensate with high performance in another. A nation with terrible health outcomes but high GDP could still rank well.

The geometric mean changed that. Poor performance in any single dimension now pulls the overall score down more sharply, because you’re multiplying the components rather than adding them. As the UNDP noted, there is “no longer perfect substitutability across dimensions.” The geometric mean captures how well-rounded a country’s development is, rather than allowing one strong area to mask a weak one. This same logic applies whenever you’re building a composite score from components that shouldn’t fully substitute for each other.

When Not to Use It

The geometric mean has clear mathematical constraints. You can only take it of positive numbers in standard practice. If your dataset contains a zero, the entire product becomes zero, and the geometric mean collapses. If it contains negative numbers, the calculation can produce undefined or complex results. While mathematical workarounds exist for these situations, they require careful handling and are not standard.

You also don’t need a geometric mean when your data is genuinely additive. If you’re averaging test scores, heights, temperatures in Celsius, or any measurement where values combine by addition, the arithmetic mean is correct. The geometric mean isn’t inherently “better.” It’s the right tool for multiplicative relationships and the wrong tool for additive ones.

A useful test: ask yourself whether the process that generated your data involves compounding, scaling, or multiplying. If a 50% increase followed by a 50% decrease doesn’t bring you back to where you started (and it doesn’t, since you’d end up at 75% of your original value), you’re dealing with a multiplicative process and the geometric mean is your answer.

Quick Reference for Common Use Cases

  • Investment returns over multiple years: geometric mean gives the true annualized return
  • Population or revenue growth rates: geometric mean finds the equivalent constant rate
  • Bacterial or chemical concentrations: geometric mean resists distortion from extreme samples
  • Averaging ratios or relative values: geometric mean treats increases and decreases symmetrically
  • Composite scores from unlike dimensions: geometric mean penalizes imbalance across components
  • Any log-normally distributed data: geometric mean locates the true center of the distribution