What Is Phonology? The Study of Speech Sounds

Phonology is the study of how sounds are organized and patterned within a language. While it’s closely related to phonetics, which examines how speech sounds are physically produced and heard, phonology focuses on the abstract system behind those sounds: the rules, patterns, and structures that determine which sounds matter in a language, how they combine, and how they change in different contexts. Think of phonetics as studying the raw sounds themselves, and phonology as studying the blueprint a language uses to organize them.

Phonemes and Allophones

The basic unit of phonology is the phoneme, which is a sound that can change the meaning of a word. English has roughly 44 phonemes, depending on the dialect. You can identify a phoneme by finding what linguists call a “minimal pair,” two words that differ by only one sound. The words “mat” and “cat,” for example, are identical except for their first sound, which proves that /m/ and /k/ are separate phonemes in English.

But here’s where phonology gets interesting. Each phoneme can actually be pronounced in slightly different ways depending on where it appears in a word or sentence. These variants are called allophones. The “p” in “pin” has a little puff of air after it, while the “p” in “spin” doesn’t. Native English speakers rarely notice this difference because both versions function as the same phoneme. In another language, though, that puff of air might distinguish two completely different words. Two sounds qualify as allophones of the same phoneme when they’re phonetically similar to each other and never appear in the same position within a word. They essentially take turns: one shows up in certain environments, the other shows up elsewhere.

This distinction between phonemes and allophones is central to phonology. It reveals that what “counts” as a meaningful sound isn’t universal. It’s determined by the rules of each language’s sound system.

Syllable Structure

Phonology also examines how sounds group together into syllables. Every syllable has a nucleus, which is its most prominent, resonant part, almost always a vowel. Consonants that come before the nucleus form the onset, and any consonants after it form the coda. The nucleus and coda together make up the rhyme, which is why “cat” and “bat” rhyme: they share the same nucleus and coda.

The only part of a syllable that absolutely must be present is the nucleus. A word like “I” is a single nucleus with no onset or coda. But when consonants do cluster together at the beginning or end of a syllable, languages have strict rules about which combinations are allowed. English permits “str” at the start of a syllable (as in “string”) but never “mbl.” These constraints, called phonotactics, vary from language to language, which is one reason foreign words can feel so difficult to pronounce.

When a word has multiple syllables, the boundaries between them follow a predictable pattern: onsets are “greedy.” They grab as many consonants as the language’s rules allow. In the word “extra,” for instance, the syllable break falls where it does because the second syllable’s onset claims the consonants it can legally start with.

How Sounds Change in Connected Speech

When people speak naturally, sounds don’t stay perfectly fixed. They shift, blend, and sometimes disappear entirely. Phonology describes these changes through a set of processes that operate in every language.

Assimilation happens when nearby sounds influence each other and become more alike. If you say “ten bikes” at a normal pace, the “n” at the end of “ten” often shifts toward an “m” sound because your mouth is already preparing for the “b.” Elision is the outright dropping of sounds, especially in fast speech. The word “can” in the sentence “I can go” frequently loses its vowel, shrinking from its full pronunciation to something closer to “kn.” Intrusion is the opposite: a sound gets inserted to smooth the transition between two others, like the faint “r” some British English speakers add between “idea” and “of.”

These aren’t mistakes or lazy speech. They’re systematic, rule-governed processes that phonologists can predict based on the sounds involved and their positions.

Stress, Pitch, and Intonation

Phonology doesn’t stop at individual sounds. It also covers features that stretch across syllables and sentences, known as suprasegmentals or prosody. The three main ingredients are pitch, loudness, and length.

In English, words longer than one syllable alternate between stressed and unstressed syllables. A stressed syllable is louder, longer, and higher in pitch than the syllables around it. Shifting stress can change a word’s meaning or grammatical role: “REcord” (noun) versus “reCORD” (verb).

Some languages take pitch even further. In tonal languages like Mandarin, the pitch pattern on a single syllable changes the word’s meaning entirely. When pitch is used this way, it’s called tone. English doesn’t use tone at the word level, but it does use intonation, which is pitch variation across a sentence to signal things like questions, surprise, or sarcasm. “You’re leaving” can be a flat statement or a shocked question depending entirely on how the pitch moves at the end.

Distinctive Features

Linguists needed a way to describe exactly how phonemes differ from one another, so in the 1950s and 1960s, researchers developed a system of distinctive features. Rather than treating each sound as a unique, indivisible unit, this approach breaks sounds down into a set of binary properties: each feature is either present or absent. A sound is either voiced (vocal cords vibrating) or voiceless, either nasal (air flowing through the nose) or not.

This system explains why certain sounds tend to behave as groups. Sounds that share features often undergo the same phonological processes. It also explains minimal contrasts efficiently. The only thing separating “b” from “p” is voicing: they’re identical in every other feature. Distinctive features give phonologists a precise vocabulary for writing rules about how sounds pattern and change.

Phonology and Learning to Read

Phonological awareness, the ability to recognize and manipulate the sound structure of words, is one of the strongest predictors of early reading success. This includes skills like breaking words into syllables, identifying the first sound in a word, and blending individual sounds back into whole words.

Research consistently shows that children with reading difficulties score significantly lower on phonological awareness tasks compared to peers without reading difficulties. This holds true across specific skills like phoneme segmentation (breaking “dog” into d-o-g) and initial phoneme identification (recognizing that “fish” starts with /f/). The connection makes sense: alphabetic writing systems map letters to sounds, so a child who struggles to hear the individual sounds in speech will struggle to connect those sounds to letters on a page.

Phonological Disorders

Speech sound disorders in children fall into two broad categories, and phonology is key to understanding the difference. An articulation disorder involves difficulty physically producing a particular sound, often due to a structural or motor issue with the mouth, tongue, or jaw. A phonological disorder, by contrast, is a problem with the sound system itself. The child may be physically capable of producing a sound but uses it in the wrong places, substitutes it with predictable patterns, or fails to recognize how it contrasts with other sounds.

Children with phonological disorders tend to show consistent, pattern-based errors. They might replace all sounds made at the back of the mouth with sounds made at the front, saying “tat” for “cat” and “doat” for “goat.” This isn’t a muscle problem; it’s a problem organizing the rules of the sound system. Some children show inconsistent errors instead, producing the same word differently each time, which points to difficulty planning whole-word sound sequences. Speech-language pathologists evaluate the physical speech mechanism alongside the child’s error patterns to determine whether the root cause is structural, neurological, or phonological.