What Is IQ Testing? Tests, Scores, and Limitations

IQ testing is a standardized method of measuring cognitive ability, producing a score that represents how a person’s intellectual functioning compares to the general population. The average score is 100, and about two-thirds of people score between 85 and 115. These tests have been used for over a century, originally developed in the early 1900s by Alfred Binet to identify children in Paris who needed extra academic support. Today they serve a much wider range of purposes, from diagnosing learning disabilities to qualifying students for gifted programs.

What IQ Tests Actually Measure

IQ tests don’t measure a single ability. Modern tests break intelligence into several distinct cognitive skills and score each one separately. The most widely used test for children, the Wechsler Intelligence Scale for Children (now in its fifth edition), measures five primary areas: verbal comprehension (how well you understand and use language), visual spatial ability (how well you analyze visual information), fluid reasoning (your ability to solve novel problems and recognize patterns), working memory (how much information you can hold and manipulate in your mind at once), and processing speed (how quickly you can perform simple cognitive tasks accurately).

Each of these areas produces its own score, and they combine into an overall number called the Full Scale IQ. This structure matters because two people can have the same overall score while showing very different cognitive profiles. One person might excel at verbal tasks but struggle with processing speed, while another shows the opposite pattern. Psychologists look at these patterns, not just the single number, to understand a person’s strengths and weaknesses.

The theoretical framework behind modern IQ tests is called Cattell-Horn-Carroll theory, a synthesis of decades of research on intelligence. It proposes that intelligence operates on multiple levels: broad abilities like fluid reasoning and crystallized knowledge sit beneath a general intellectual factor, and narrow, specific skills sit beneath those. Cross-battery research has confirmed that this framework holds up across multiple major intelligence tests, which is why most current tests are designed around it.

How Scores Work

IQ scores follow a bell curve distribution. The test is designed so that 100 is always the average, with a standard deviation of 15 points. That means roughly 68% of people score between 85 and 115, and about 95% fall between 70 and 130.

The extremes of the curve carry clinical significance. A score of 130 or higher is considered superior and places a person in approximately the top 2% of the population. This is the threshold many states use when identifying students for gifted education programs, though most guidelines specify that an IQ score alone isn’t sufficient for placement. Pennsylvania’s guidelines, for example, require multiple criteria indicating gifted ability even when a student scores 130 or above.

At the other end, a score of 70 or below (the bottom 2%) can indicate an intellectual disability, but only when paired with significant difficulties in everyday functioning. Things like managing personal care, navigating social situations, and meeting age-appropriate responsibilities all factor into the diagnosis. A low score on its own doesn’t determine the outcome.

Mensa, the well-known high-IQ society, admits people who score in the top 2% on an approved standardized test. In practice, that means a score of 132 or higher on the Stanford-Binet test, or 148 or higher on certain other scales that use a different scoring system.

The Major Tests and Who Takes Them

Several IQ tests exist, but the Wechsler scales dominate clinical practice. David Wechsler created his first test in 1939 specifically for adults because the existing Stanford-Binet test had been designed for children and relied too heavily on verbal ability. Wechsler wanted a test that also measured nonverbal, performance-based skills. His approach proved so successful that it expanded into a family of tests: the WAIS for adults (now in its fifth edition in the U.S.), the WISC for school-age children, and the WPPSI for preschoolers.

The Stanford-Binet, now also in its fifth edition, remains widely used and covers all age groups from early childhood through adulthood. Both test families produce comparable Full Scale IQ scores, though their internal structures differ slightly. The newest WAIS-V calculates its Full Scale IQ from seven subtests covering vocabulary, similarities, block design, matrix reasoning, quantitative reasoning, digit sequencing, and coding. It also introduced a dedicated Fluid Reasoning index, reflecting growing recognition of how important novel problem-solving ability is to understanding intelligence.

What IQ Testing Is Used For

The most common use is in educational settings. Schools use IQ tests to help identify children who may have learning disabilities, intellectual disabilities, or giftedness. When a child is struggling academically, an IQ test can reveal whether the difficulty stems from a specific cognitive weakness (like poor working memory) rather than an overall lack of ability. That distinction shapes the kind of support the child receives. For students who qualify as both gifted and eligible for special education, a single individualized plan can be developed to address both sets of needs.

Clinicians also use IQ testing as part of broader psychological evaluations. During the testing session, psychologists observe not just the scores but how the person approaches problems, handles frustration, and sustains attention. These behavioral observations, combined with the pattern of scores across different cognitive areas, help build a more complete picture than any single number could provide.

Outside of clinical and educational settings, cognitive aptitude tests derived from IQ research are used in military placement, vocational assessment, and some employment contexts to match people with roles that fit their skill profiles.

What the Test Experience Looks Like

A professionally administered IQ test is a one-on-one session with a trained psychologist or psychometrist. You sit across from the examiner and work through a series of tasks: defining words, solving visual puzzles, repeating number sequences, assembling block patterns, and completing timed symbol-matching exercises, among others. The full session typically takes two to three hours, though some tests can be shorter depending on the version and purpose.

If you’re pursuing private testing (not through a school or insurance-covered referral), expect to pay between $500 and $2,000. That cost generally covers the testing session itself, scoring, interpretation, and a detailed written report. Prices vary by location and provider. School-based testing for children suspected of having a disability is typically provided at no cost to families through the public school system.

Known Limitations and Biases

IQ tests are imperfect tools, and their limitations are well documented. One of the most significant concerns is cultural bias. Most major tests were normed primarily on majority-group populations, which can make scores less valid for people from different cultural or linguistic backgrounds. If your cultural or linguistic background wasn’t well represented in the group used to establish the test’s norms, your score may not accurately reflect your ability.

Language creates particularly tricky problems. For non-native English speakers, a timed test administered in English can underestimate ability simply because processing a second language takes longer. Even within the same language, words carry different meanings across cultures. The Spanish word “educación” emphasizes social skills and respectful behavior, while the English “education” focuses on cognitive learning. These subtle differences can shift how someone interprets and responds to test questions.

Cultural reasoning styles also affect performance. Research comparing Western and Liberian participants found that when asked to sort objects into groups, Westerners categorized by type (food, tools), while Liberian participants paired items by practical use (a knife with a potato, because you use one to cut the other). Both approaches reflect logical thinking, but only one matches what the test expects. Similarly, Native American students asked “Who is the son of your aunt?” consistently answered “brother” because their kinship system considers all relatives of the same generation to be siblings. The “correct” test answer of “cousin” reflects a specific cultural framework, not superior reasoning.

These issues don’t make IQ tests useless, but they do mean scores should be interpreted carefully, especially for people from underrepresented backgrounds. A skilled evaluator accounts for language proficiency, cultural context, and testing conditions when making sense of results, rather than treating the number as a fixed, objective measure of someone’s potential.