Babies don’t have a formal language with grammar and words, but they absolutely have a structured system of communication that develops in predictable stages. From birth, infants use cries, coos, facial expressions, and body movements to signal their needs, and these sounds are far from random. By the time a baby starts babbling around 6 months, they’re already practicing the building blocks of whatever language they’ll eventually speak.
What Infant Sounds Actually Mean
Newborn cries carry real acoustic information that distinguishes one need from another. Pain cries and colic cries have a significantly higher pitch than hunger or fussy cries. Colic cries are the most intense of all, outranking even pain cries in energy, length, and pitch. Hunger and fussy cries, by contrast, are lower intensity and shorter. Parents who spend time with their baby often learn to hear these differences intuitively, even if they can’t explain exactly what’s different.
One popular theory, called Dunstan Baby Language, proposes that all infants worldwide produce five specific pre-cry sounds in their first three months: “neh” (hungry), “owh” (sleepy), “eh” (needs to burp), “eairh” (stomach pain), and “heh” (uncomfortable). The idea is that these sounds are reflexive, triggered by physical sensations before a full cry begins. While several researchers have used these categories to build cry-classification systems, the theory hasn’t been rigorously validated in large-scale peer-reviewed studies. It’s a useful framework for new parents, but it’s not settled science.
How Vocal Development Unfolds
Babies move through a remarkably consistent sequence of vocal stages in their first year, and each one represents a leap in their ability to communicate.
From birth to about 3 months, babies coo and make pleasure sounds. These early vocalizations are mostly vowel-like (“oooh,” “aaah”) and tend to happen when the baby is content. They’re not trying to say anything specific, but they’re already learning that making sounds gets a response.
Between 4 and 6 months, babbling begins. Babies start combining consonants and vowels, experimenting with sounds that begin with p, b, and m. This stage sounds more speech-like, and babies often seem to be “talking” even though they’re not forming words. They’re mapping out which mouth movements produce which sounds.
From 7 months to a year, babbling becomes more complex. Babies string together long and short groups of repeated syllables like “tata,” “upup,” or “bibibi.” This is called canonical babbling, and it’s a critical milestone. The rhythm and melody of these strings start to mirror the language the baby hears most. A baby raised in a Mandarin-speaking home begins to sound different from one raised in an English-speaking home, even though neither is saying real words yet. By their first birthday, most children can call a parent “mama” or “dada” or use another special name with intention.
Communication Before Words
Sounds are only part of how babies “talk.” From very early on, they use their face, hands, and body to get messages across. Babies who are tired often rub their face, tug on their ears, or ball their hands into fists. Hungry babies root, turning toward anything that touches their face and trying to latch onto fingers, hands, or clothing. These aren’t random movements. They’re hardwired reflexes that communicate specific physical states.
Around 10 months, something more sophisticated emerges: intentional gestures. Researchers describe two types. The first is what’s called a proto-imperative gesture, where a baby points at a toy because they want you to get it for them. They’re using you as a tool to obtain something. The second is a proto-declarative gesture, where a baby points at a dog walking by because they want you to look at it too. They’re using an object to get your attention and share an experience. That second type is especially significant because it shows the baby understands that other people have their own focus of attention, a cognitive leap that’s foundational for language.
Why Your Response Shapes Their Language
The way adults talk to babies has a measurable effect on how quickly language develops. Infants who hear more speech directed specifically at them (not just overheard adult conversation) become faster at processing familiar words in real time and develop larger vocabularies by age 2. In one study, the amount of child-directed speech a baby heard at 19 months strongly predicted their vocabulary size at 24 months, with a correlation of 0.57. Even after accounting for how much the babies themselves were vocalizing, the relationship held strong at 0.51.
The key ingredient isn’t just volume of words. It’s the back-and-forth quality. When you respond to a baby’s coo with a smile and a sentence, and the baby coos back, you’re having what researchers call a proto-conversation. Brain imaging studies show that during these face-to-face social interactions, areas of the infant brain involved in attention light up significantly more than during non-social situations. Remarkably, this neural activity during social exchanges predicts language ability more than two years later, well into the child’s third year of life. The baby’s brain is already wiring itself for language long before the first word arrives.
This is why “parentese,” that high-pitched, sing-song way adults instinctively talk to babies, works so well. It’s not baby talk in the sense of made-up words. It’s real language delivered with exaggerated pitch, slower pace, and clear enunciation. Babies pay more attention to it, process it more efficiently, and learn from it faster than from normal adult speech.
Are Baby Sounds Universal Across Cultures?
The earliest sounds babies make are largely universal. A newborn’s hunger cry sounds similar whether the baby is born in Tokyo, Lagos, or São Paulo, because those early vocalizations are driven by reflexes and anatomy rather than learned behavior. Cooing and early babbling also follow the same timeline across cultures.
But cultural differences appear surprisingly early. Research comparing infants from urban Germany and rural Ecuador found that by 3 months, German babies were already showing higher-intensity positive expressions during social interaction, likely reflecting the more expressive, face-to-face parenting style common in that cultural setting. By 4.5 months, the gap narrowed as babies from both backgrounds responded similarly to engaging stimulation. This suggests that while the biological machinery of infant communication is universal, the social environment starts shaping how babies express themselves within the first few months of life.
By the second half of the first year, babbling increasingly takes on the sound patterns of the baby’s native language. The consonants they favor, the rhythms they practice, and the melodies of their babble all begin reflecting what they hear around them. So babies start with a shared “language” of cries and coos, then gradually tune their communication system to match the specific world they’re growing up in.

