The explanatory variable is the independent variable. These are two names for the same role in a study: the variable that is used to predict or explain differences in another variable. Its counterpart, the response variable, is also known as the dependent variable.
If you’re seeing both terms in a textbook or assignment and wondering whether they mean different things, they don’t. But the reason two labels exist is worth understanding, because it affects how you think about your data.
Why Two Names Exist
Statistics borrows vocabulary from different traditions. In experimental science, where a researcher directly manipulates one variable and measures the effect on another, the standard terms are “independent” and “dependent.” The independent variable is independent because the researcher sets its values freely. The dependent variable is dependent because its values depend on what happens to the independent variable.
In statistics and data analysis courses, the preferred pair is often “explanatory” and “response.” These terms are more descriptive of what you’re actually doing with the data: using one variable to explain variation in another. A third common pair, especially in regression and machine learning, is “predictor” and “outcome.” All three pairs describe the same relationship.
Where Each Term Fits on a Graph
Regardless of which name you use, the explanatory (independent, predictor) variable goes on the x-axis of a scatterplot. The response (dependent, outcome) variable goes on the y-axis. In a regression equation like y = mx + b, x is the explanatory variable and y is the response variable.
A quick way to remember: the explanatory variable is the one you think comes first, either in time or in logic. If you’re studying whether hours of exercise per week predicts resting heart rate, exercise hours is the explanatory variable (x-axis) and resting heart rate is the response variable (y-axis).
When “Explanatory” Is a Better Choice Than “Independent”
Although the terms are interchangeable, there’s a reason many statistics instructors lean toward “explanatory.” The word “independent” can imply that the researcher controlled the variable, which is true in a randomized experiment but not in an observational study. If you’re analyzing survey data on income and education level, nobody assigned people their education. Calling education the “independent variable” could suggest more control than actually existed. Calling it the “explanatory variable” is more neutral: it simply means you’re using education to explain patterns in income, without claiming you manipulated it.
This distinction matters because it connects to a deeper issue: causation. In a true experiment, where the researcher randomly assigns participants to different levels of the independent variable, you can make causal claims. In an observational study, you can identify associations but not prove one variable causes changes in another. Confounding, where an unmeasured third factor influences both variables, is always a possibility in observational data. Using the term “explanatory” serves as a subtle reminder that explaining variation is not the same as proving a cause.
How to Identify the Explanatory Variable
In any study or homework problem, ask yourself: which variable is being used to predict or explain, and which variable is the outcome being measured? A few examples make this concrete:
- Drug dosage and symptom relief. Dosage is the explanatory variable. Symptom relief is the response variable. In a clinical trial, the researcher controls dosage directly, so “independent variable” fits perfectly here.
- Hours of sleep and test scores. Sleep is the explanatory variable. Test scores are the response. If this is survey data rather than an experiment, “explanatory” is the more precise term.
- Smoking and lung capacity. Smoking status is explanatory. Lung capacity is the response. Since researchers can’t ethically assign people to smoke, this is observational, and calling smoking the “explanatory variable” avoids overstating the study design.
- Fertilizer amount and crop yield. Fertilizer is explanatory. Yield is the response. If a farmer randomly applies different amounts to different plots, this is experimental, and “independent variable” works just as well.
The explanatory variable doesn’t have to cause the response. It just needs to be the one you’re using to explain or predict it.
A Quick Reference for All the Synonyms
Because different fields and textbooks favor different terminology, here’s how the pairs line up:
- Explanatory variable = independent variable = predictor variable (x-axis)
- Response variable = dependent variable = outcome variable (y-axis)
If your course uses one pair, your textbook uses another, and an online resource uses a third, they’re all describing the same two roles. The explanatory variable is always the independent variable, never the dependent one.

