Why Are Number 2 Pencils the Standard for Tests?

Number 2 pencils became the standard for tests because their graphite cores strike the ideal balance between being dark enough for machines to read and hard enough to erase cleanly. This wasn’t an arbitrary choice. It traces back to the 1930s, when the first automated scoring machines relied on the electrical properties of graphite to detect answers, and the number 2 grade turned out to be the sweet spot.

How Graphite Hardness Works

Pencils are graded on a scale from hard (light marks) to soft (dark marks). In the U.S. numbering system, a number 1 pencil is the hardest, leaving faint, thin lines. Higher numbers are softer, depositing more graphite and creating darker marks. The number 2 pencil sits right in the middle, which is why it’s equivalent to “HB” on the international grading scale used by most pencil manufacturers outside the United States. H stands for hard, B stands for black, and HB means you get a bit of both.

A softer pencil (number 3 or 4) would leave darker, thicker marks, but it would also smudge easily and be harder to erase completely. A harder pencil (number 1) would stay cleaner but might not leave enough material on the page for a machine to pick up reliably. The number 2 grade deposits enough graphite to be clearly detected while still erasing without leaving residue that could confuse a scanner.

The Machines That Started It All

The connection between pencils and test scoring goes back to IBM’s 805 Test Scoring Machine, one of the earliest devices built to grade answer sheets automatically. The 805 was an electrical machine that used tiny circuits responding to the conductivity of graphite to detect pencil marks. Graphite is a form of carbon that conducts electricity, so when a student filled in a bubble, the graphite completed a small circuit on the answer sheet.

Getting this to work reliably was harder than it sounds. Early prototypes by an IBM engineer named Reynold Wood struggled because the results were erratic. When students pressed lightly, the pencil marks conducted less electricity, and the machine became less accurate. The lighter the mark, the less reliable the score. A later design by Benjamin Wood (no relation) solved this by introducing high-resistance components into the circuits, raising the total resistance so that small variations in how hard someone pressed no longer threw off the reading. The finished machine used a contact plate with 750 small circular circuits, each corresponding to an answer position on the card.

The number 2 pencil became the default because it deposited enough conductive graphite to trigger these circuits consistently, even with normal variation in how hard different people press. Softer pencils would have worked too, but their smudging and erasing problems made them impractical for a test where students constantly change answers.

From Electrical Sensing to Optical Scanning

The technology shifted in 1972 when Scantron was founded in Los Angeles by William E. Sanders and Michael Sokolski, who invented optical mark recognition (OMR) scanners. Instead of measuring electrical conductivity, these machines shine a light on the answer sheet and measure how much light is reflected or absorbed. Graphite absorbs light effectively, so a filled-in bubble appears dark to the sensor while the blank paper around it reflects light back.

This shift from electrical to optical sensing could have loosened the pencil requirement, since any dark mark would theoretically work. But the number 2 pencil remained the standard for practical reasons. It still provides the best combination of reliable darkness, clean erasing, and consistent performance across millions of test-takers. Pen ink, for instance, is dark enough for scanners but impossible to erase, which is a problem when a machine might read a crossed-out answer as a response. Mechanical pencils often use thinner, harder lead that doesn’t fill bubbles as completely.

Why the Requirement Still Exists Today

Even as standardized testing moves increasingly to digital formats, paper-based exams still specify number 2 pencils. The ACT’s current test day instructions tell students to bring sharpened No. 2 pencils with erasers, explicitly ruling out pens and mechanical pencils. The College Board has historically maintained the same requirement for the SAT.

The reason hasn’t changed much in 90 years. Optical scanners process millions of answer sheets, and consistency matters enormously at that scale. A single grade of pencil that everyone uses eliminates a variable. If students showed up with pencils of different hardnesses, some marks would be too faint, others would smudge across adjacent bubbles during handling, and error rates would climb. The number 2 pencil, by landing in the middle of the hardness spectrum, minimizes all of these failure modes at once. It’s less about the pencil being special and more about it being reliably average in exactly the right way.