What Is Sonority? Meaning, Hierarchy, and Sound

Sonority is the amount of acoustic energy a speech sound carries. In simpler terms, it describes how “loud” or “open” a sound naturally is when you produce it, though it’s not purely about volume. Vowels like “ah” are highly sonorous because your mouth is wide open and your vocal folds are vibrating freely. Consonants like “p” or “k” are much less sonorous because your lips or tongue block the airflow. This property is fundamental to how every spoken language on Earth organizes sounds into syllables and words.

What Makes a Sound More Sonorous

Two physical factors determine a sound’s sonority: airflow from the lungs and vibration of the vocal folds. When both are at their maximum, you get the most sonorous sounds. When one or both are reduced, sonority drops.

Vowels sit at the top of the sonority scale because your vocal tract stays open while your vocal folds vibrate. Nothing blocks the air, so a large wave of acoustic energy leaves your mouth. Consonants fall lower on the scale because your tongue, lips, or teeth partially or fully obstruct the airflow, cutting the energy output. There’s also an intermediate category called glides (the “y” sound in “yes” or the “w” in “wet”) that behave like vowels in some ways since the vocal tract is unobstructed, but they’re shorter and carry less energy.

The Sonority Hierarchy

Linguists rank all speech sounds on a scale from least to most sonorous. This ranking, called the sonority hierarchy, is consistent across all known languages. From lowest to highest sonority, it looks like this:

Voiceless stops (p, t, k): the least sonorous, because airflow is completely blocked and the vocal folds don’t vibrate
Voiced stops (b, d, g): slightly more sonorous because the vocal folds vibrate, but airflow is still fully blocked
Voiceless fricatives (f, s, sh): air is forced through a narrow gap, creating turbulence but no vocal fold vibration
Voiced fricatives (v, z): same narrow gap, but with vocal fold vibration added
Nasals (m, n, ng): air flows freely through the nose while the vocal folds vibrate
Liquids (r, l): the tongue partially shapes the airflow but doesn’t create much obstruction
Glides (w, y): the vocal tract is essentially open, like a vowel, but the sound is brief
High vowels (ee, oo): sonorous, though the tongue is relatively high in the mouth
Mid vowels (eh, oh): more open, more energy
Low vowels (ah): the most sonorous sounds in any language, with the mouth wide open and vocal folds vibrating freely

The broad pattern is straightforward: the more open your mouth and the more your vocal folds participate, the higher the sonority.

How Sonority Shapes Syllables

The reason linguists care about sonority isn’t just classification. It’s the organizing principle behind syllables. A syllable is, at its core, a peak of sonority surrounded by less sonorous sounds. The most sonorous sound in the syllable (almost always a vowel) is called the nucleus, and the less sonorous sounds cluster around it at the edges.

This pattern is governed by a rule called the Sonority Sequencing Principle. It states that sonority must rise from the beginning of a syllable toward the nucleus, then fall after it. Think of it as an arch shape: consonants at the margins, a vowel at the peak. The word “plant,” for example, starts with the low-sonority “p,” rises through “l” (a liquid, more sonorous), reaches the peak at “a” (a vowel), then falls through “n” (a nasal) to “t” (a stop). That rising and falling pattern is exactly what the principle predicts.

This isn’t just an English rule. It operates in every spoken language, which is part of what makes sonority such a powerful concept in linguistics. It explains why certain combinations of sounds feel natural and pronounceable while others feel impossible.

Why Some Sound Combinations Work and Others Don’t

Sonority also explains which consonant clusters a language allows. When two consonants appear together at the start of a word, the second one generally needs to be more sonorous than the first, so that sonority is still rising toward the vowel. But languages differ in how big a sonority gap they require between the two consonants.

English, for example, requires a minimum sonority distance of two steps on the hierarchy. This is why “fly” works: “f” is a fricative (low sonority) and “l” is a liquid (much higher), giving a comfortable gap of two steps. But no English word starts with “fn,” because “f” (a fricative) and “n” (a nasal) are only one step apart on the scale. The cluster “kw” in “quick” also works easily, since the gap between a stop and a glide is three steps, well above the minimum. Other languages set their own thresholds, which is why Russian or Georgian speakers can pronounce clusters that sound impossible to English ears.

The Odd Case of S-Clusters

If you’ve been thinking about words like “stop,” “skate,” or “spin,” you’ve spotted a real problem. In these words, the “s” comes before a stop consonant (t, k, p), which means sonority actually decreases from the first sound to the second. That’s a direct violation of the Sonority Sequencing Principle.

These sibilant-stop clusters are among the most prominent exceptions to the rule, and they appear in many languages, not just English. German has the same pattern with clusters like “sht” and “shp.” Linguists have debated for decades how to handle these. Some argue that the “s” in these clusters occupies a special position outside the normal syllable structure, essentially treating it as an appendix rather than a true onset consonant. Others have proposed that sibilants simply play by different rules. Either way, these clusters are a well-known wrinkle in an otherwise remarkably consistent pattern.

Why Sonority Matters Beyond Linguistics

Sonority isn’t just an abstract classification system. It shapes how clearly speech travels and how easily listeners can pick words apart. Your auditory system separates incoming sounds partly by mapping their frequency content along the cochlea, the spiral structure in the inner ear. Sounds with very different acoustic profiles activate different populations of nerve fibers, making them easier to distinguish. Because high-sonority sounds (vowels) and low-sonority sounds (stops) have such different energy profiles, the alternation between them creates a natural rhythm that helps your brain segment continuous speech into individual syllables and words.

This is also why vowels tend to form the backbone of words you can hear across a noisy room. They carry more acoustic energy, travel farther, and persist longer than consonants. The sonority peaks in a sentence are essentially the scaffolding your brain uses to reconstruct what someone said, even when background noise swallows some of the quieter consonants.