What Is Manual Muscle Testing? Uses, Grading, and Limits

Manual muscle testing (MMT) is a clinical exam where a trained practitioner evaluates the strength of individual muscle groups by applying resistance with their hands and grading the response on a standardized scale, typically from 0 to 5. It requires no equipment, takes only minutes per muscle group, and is one of the most widely used strength assessments in physical therapy, rehabilitation medicine, and neurology.

How the Test Works

The basic setup is straightforward. You’re positioned so a specific muscle group can be isolated, and the examiner uses one hand to apply resistance (or feel for a contraction) while the other hand stabilizes the limb being tested. Each muscle group is tested on both sides of the body so the examiner can compare left to right. The goal is to determine how much force the muscle can produce and whether it can hold a position against gravity and external pressure.

There are two main testing techniques. In a “break test,” you hold a position while the examiner gradually increases pressure, trying to overcome your muscle. If the muscle gives way, the examiner notes the point at which it “breaks.” In a “make test,” the examiner holds completely still while you push as hard as you can against their hand. Break tests tend to produce higher peak force readings because they involve the muscle resisting while lengthening, while make tests measure pure pushing strength against a fixed point. The make test requires the examiner to be stronger than the person being tested, which can be a practical limitation.

The 0 to 5 Grading Scale

Most clinicians use a version of the Oxford scale, which assigns a grade from 0 (no function) to 5 (full strength). Here’s what each grade means in practice:

Grade 0 (Zero): No contraction at all. The examiner can’t feel any muscle activity.
Grade 1 (Trace): A faint contraction is felt in the muscle or its tendon, but the limb doesn’t move.
Grade 2 (Poor): The muscle can move the limb through its full range, but only when gravity is taken out of the equation (for example, sliding the arm across a table rather than lifting it).
Grade 3 (Fair): The muscle can move the limb against gravity but can’t hold the position when the examiner adds pressure.
Grade 4 (Good): The muscle holds its position against moderate pressure from the examiner.
Grade 5 (Normal): The muscle holds its position against strong pressure, matching what’s expected for someone of that age and build.

The distinction between grades 2 and 3 is especially important because it marks the threshold where a muscle can work against gravity. Below grade 3, the examiner changes the patient’s position to remove gravity from the equation, testing the muscle in a horizontal plane instead. Some grading systems add plus and minus designations (like 4+ or 3−) to capture finer differences. The Daniels and Worthingham system, for instance, uses 8 intervals with these modifiers, giving clinicians more precision than the basic 6-point scale.

What It’s Used For

MMT serves several clinical purposes. In rehabilitation, it tracks recovery over time. After a stroke, spinal cord injury, or prolonged ICU stay, regular muscle testing shows whether strength is returning and which muscle groups are lagging behind. In neurology, patterns of weakness across specific muscle groups can help pinpoint which nerve or nerve root is affected. A physical therapist might also use it before designing an exercise program to identify which muscles need the most work.

It’s particularly valuable for patients with significant weakness (grades 0 through 3), where detecting any muscle activity at all is clinically meaningful. Research has demonstrated it’s a reliable method even in critically ill patients, including those recovering from acute respiratory distress syndrome, as long as the patient can sit upright in bed and follow simple two-step commands.

Two Major Testing Approaches

Two textbook methods dominate clinical practice, and they don’t always agree. The Daniels and Worthingham method incorporates functional tasks into grading. For calf strength, for example, it requires a single-leg heel raise for grades 3 through 5. If you can’t perform that task, the highest grade you can receive is 2+, and the examiner switches to a hands-on break test. This approach tends to spread scores across a wider range, making it easier to detect meaningful differences between patients.

The Kendall method relies more heavily on manual resistance applied by the examiner and uses a 10-point scale. While it’s well-established, research has found it can produce a ceiling effect in stronger patients. In one study comparing the two methods for ankle strength, 100% of patients who could do a single-leg heel raise and 58% of those who couldn’t both scored the maximum Kendall grade. The Daniels and Worthingham approach produced a wider spread of scores, suggesting it may be more sensitive to real differences in strength.

How Reliable Are the Results?

When the same trained examiner tests the same patient on two separate occasions, MMT generally produces consistent results. Studies have found high correlation coefficients for most muscle groups when a single tester repeats the assessment. The reliability is strongest in weaker muscles (grades 0 through 3), where the differences between grades are clear-cut: either the muscle contracts or it doesn’t, either the limb moves against gravity or it can’t.

Reliability becomes more problematic at higher grades. The difference between grade 4 and grade 5 depends on the examiner’s subjective judgment of “moderate” versus “strong” pressure, and two different examiners may not agree. The examiner’s own strength also matters. A smaller clinician testing a large, strong patient may perceive resistance differently than a larger clinician would.

Where MMT Falls Short

The biggest limitation is its lack of precision in the upper strength range. A grade 4 (“good”) can represent an enormous span of actual force production. Someone who is mildly weak and someone who is substantially weak might both receive a grade 4 because the scale simply doesn’t have enough resolution at the top end. In one study of patients with knee osteoarthritis, handheld dynamometry measurements showed clear extensor weakness, while MMT grades indicated “good” strength. The correlation between the two methods for knee extensors was only 0.24, meaning MMT was missing real deficits that an instrument could detect.

Handheld dynamometry, which uses a small device to measure force in pounds or kilograms, is less subjective than MMT at stronger grades and can detect smaller changes over time. For this reason, clinicians sometimes use MMT as a screening tool and switch to dynamometry when they need to track subtle strength changes in patients who already score in the 4 to 5 range.

MMT also can’t distinguish between true muscle weakness and pain-limited weakness. If contracting a muscle hurts, you’ll produce less force regardless of how strong the muscle actually is. Experienced examiners note when pain appears to limit the test, but this remains a judgment call. The technique also depends on the patient’s cooperation and effort, which makes it less useful for patients who can’t understand instructions or who aren’t fully engaged in the exam.