What Is Phonetics? Definition, Branches, and Examples

Phonetics is the study of speech sounds: how your body produces them, how they travel through the air, and how your ears perceive them. It covers every sound the human voice can make, regardless of language, and uses a universal system of symbols to write those sounds down precisely. If you’ve ever wondered why “phone” and “fone” sound identical despite different spellings, or why a “k” sounds slightly different in “keep” versus “cool,” phonetics has the answers.

The Three Branches of Phonetics

Phonetics splits into three areas, each focused on a different stage of the journey a speech sound takes from speaker to listener.

Articulatory phonetics looks at how your body physically creates sounds. It’s concerned with the positions of your tongue, lips, teeth, and vocal cords during speech. This is the branch most people encounter first because it’s the most hands-on: you can feel a sound being made.

Acoustic phonetics studies the sound waves that travel through the air after you speak. Every speech sound creates pressure changes in the air that can be measured by frequency (pitch) and amplitude (loudness). Frequency is counted in hertz, the number of pressure cycles per second. A higher frequency generally means a higher-pitched sound. Amplitude measures how large those pressure waves are, which your ear interprets as volume. Researchers use a visual tool called a spectrum, a graph plotting frequency against amplitude, to see the “fingerprint” of any speech sound.

Auditory phonetics focuses on the listener’s end, examining how the ear and brain decode those sound waves back into meaningful speech.

How Your Body Makes Speech Sounds

Your vocal tract is essentially a tube running from your lungs up through your throat and out your mouth and nose. Along that tube, several structures shape the air into distinct sounds. The vocal cords (two small folds of tissue in your throat) vibrate to create voice. The space between them, called the glottis, opens and closes rapidly during vibration. Above that, your tongue, teeth, the bony ridge behind your upper teeth (the alveolar ridge), the roof of your mouth, your soft palate (velum), and your lips all work together to redirect, block, or squeeze the airflow in different ways.

Every consonant you produce can be described by three things: where in the mouth you create the constriction, how much you constrict the airflow, and whether your vocal cords vibrate while you do it.

Voiced and Voiceless Sounds

Place your fingers on your throat and say “buh.” You’ll feel a vibration. Now say “puh.” Same lip position, but the vibration is gone. That vibration is voicing, and it’s the only difference between many pairs of English consonants. Your lips, tongue, and jaw do the same work for both sounds in each pair. The vocal cords either join in or stay silent.

Here are some common voiced/voiceless pairs with example words:

/b/ and /p/: “back” vs. “pencil”
/d/ and /t/: “day” vs. “two”
/g/ and /k/: “gold” vs. “candy”
/v/ and /f/: “very” vs. “find”
/z/ and /s/: “zone” vs. “see”

This distinction matters in everyday speech. Swap just the voicing and you change the word entirely: “save” becomes “safe,” “wide” becomes “white,” “prize” becomes “price.”

Place and Manner of Articulation

Beyond voicing, consonants differ in where and how the airflow is blocked. “Place of articulation” refers to the spot in the mouth where the constriction happens. The /p/ and /b/ sounds are made at the lips (bilabial). The /t/ and /d/ sounds are made with the tongue pressed against the alveolar ridge just behind the upper teeth (alveolar). The “h” at the start of “how” is made down at the glottis itself.

“Manner of articulation” describes how completely the air is blocked. Say “t” and try to hold it. You can’t, because your tongue completely stops the airflow for an instant. That’s called a stop. Now say “s” and hold it. You can sustain it for as long as you have breath, because the tongue only partially blocks the air, letting it hiss through a narrow gap. That’s a fricative. Both /t/ and /s/ are made at roughly the same spot (the alveolar ridge), but the degree of constriction is what makes them sound completely different.

How Vowels Are Classified

Vowels are more open than consonants. There’s no blockage of airflow, so linguists classify them differently, using four features: how high the tongue sits in the mouth, how far forward or back it is, whether the lips are rounded, and how tense the tongue muscles are.

You can feel this yourself. Say the vowels in “beet,” “bit,” “bait,” “bet,” and “bat” slowly, one after another. Your tongue starts high and at the front of your mouth for “beet” and gradually drops lower for each word. Now try “boot” and “boat.” Your tongue is high and at the back for “boot,” then drops for “boat.” The lowest back vowel is the “ah” sound in “father.”

Diphthongs: Two Vowels in One

A diphthong is a vowel sound that starts in one position and glides into another within a single syllable. Unlike a pure vowel where your mouth stays relatively still, a diphthong requires your tongue and lips to move while you’re producing it.

English has several common diphthongs. The vowel in “pay” starts with an “eh” sound and glides toward “ee.” The vowel in “boy” starts with an “aw” and glides toward “ee.” The vowel in “how” starts with an “ah” and glides toward “oo.” You can feel the movement if you say these words slowly and pay attention to how your mouth changes shape mid-vowel.

The International Phonetic Alphabet

English spelling is notoriously unreliable as a guide to pronunciation. The “f” sound can be spelled “f” (find), “ph” (phone), or “gh” (enough). The letter “a” sounds different in “father,” “fate,” and “fat.” To solve this problem, linguists use the International Phonetic Alphabet (IPA), a standardized set of symbols where each symbol represents exactly one sound, and each sound gets exactly one symbol.

Some IPA symbols look familiar. Stops like [p], [b], [t], [d], and [k] use the same letters you’d expect. The word “peach” is transcribed with [p] at the start, “apple” has [p] in the middle, and “cap” ends with [p]. Fricatives work similarly: [f] covers the “f” sound whether it’s spelled “phone,” “raffle,” or “leaf.”

Other symbols are less intuitive. The “sh” sound in “she” is written [ʃ]. The “zh” sound in “measure” is [ʒ]. The “th” in “thin” is [θ], while the “th” in “the” is [ð]. These symbols let linguists, speech therapists, and language teachers write down pronunciation without any ambiguity.

Phonetics vs. Phonology

People often confuse phonetics with phonology, but they ask different questions. Phonetics studies the physical properties of speech sounds, often without even knowing what language is being spoken. Phonology studies the patterns and rules those sounds follow within a specific language.

Here’s a concrete example. The /k/ sound in “keep” is produced slightly further forward in the mouth than the /k/ in “cool,” which is made further back with rounded lips. A native English speaker never notices this difference because both sounds function as the same unit of meaning. Phonetics cares about the physical difference between these two sounds. Phonology cares that English speakers treat them as the same sound, grouping them as variants (called allophones) of a single category. The vowels in “cool,” “whose,” and “moon” are similarly all slightly different in physical production but function as variants of the same vowel in English.

Where Phonetics Is Used in Practice

Phonetics has hands-on applications well beyond the linguistics classroom. Speech-language pathologists rely on phonetic training to diagnose and treat speech sound disorders, voice disorders, stuttering, and hearing-related speech difficulties. They use IPA transcription, sometimes with specialized extensions designed for disordered speech, to document exactly what a patient is producing and track improvement over time. Instrumental tools like spectrograms give clinicians a visual readout of a patient’s speech, revealing problems that the ear alone might miss.

In forensic science, phonetic analysis helps identify speakers from audio recordings, comparing voice characteristics like pitch patterns and the acoustic fingerprints of specific sounds. Language teachers use phonetics to help students learn the sounds of a new language, pinpointing exactly where and how to position the tongue for unfamiliar consonants or vowels. And in technology, the acoustic models behind voice assistants and speech recognition software are built on the same principles of frequency, amplitude, and spectral analysis that define acoustic phonetics.