A long chain of amino acids creates a protein. More specifically, when amino acids link together into a chain of roughly 50 or more, the result is a polypeptide, and one or more polypeptides fold into a functional protein. Most proteins in the human body are between 50 and 2,000 amino acids long, and the human genome codes for approximately 20,000 different ones. These molecules handle nearly every job your body needs done, from digesting food to building muscle to carrying oxygen in your blood.
How Amino Acids Link Together
Amino acids connect through a reaction called condensation (also known as dehydration synthesis). When two amino acids join, each bond releases one molecule of water. The resulting link is called a peptide bond, a strong covalent connection that holds the chain together. A short chain of 2 to 50 amino acids is called a peptide. Once the chain grows beyond about 50 amino acids, it’s classified as a polypeptide, which is the backbone of a protein.
Your body uses 20 standard amino acids to build proteins. Nine of these are essential, meaning your body cannot make them and you must get them from food: histidine, isoleucine, leucine, lysine, methionine, phenylalanine, threonine, tryptophan, and valine. Another five can be made internally, and six more are conditionally essential, meaning the body can produce them but sometimes not fast enough to meet demand.
How Your Body Builds the Chain
Protein assembly happens on structures called ribosomes, found in the fluid inside your cells. The process starts with a molecule of messenger RNA (mRNA), which carries instructions copied from your DNA. The ribosome reads the mRNA sequence three letters at a time. Each three-letter group, called a codon, specifies one particular amino acid or signals the chain to stop growing. Translation always begins with the codon AUG, which also codes for the first amino acid in the chain.
The amino acids themselves don’t recognize the mRNA directly. Instead, small adapter molecules called transfer RNAs (tRNAs) do the matchmaking. One end of each tRNA carries a three-letter sequence that pairs with the codon on the mRNA. The other end carries the corresponding amino acid. As the ribosome moves along the mRNA, tRNAs deliver amino acids one by one, and the ribosome catalyzes the peptide bond that adds each new amino acid to the growing chain. The result is a precise sequence that can be hundreds or thousands of amino acids long.
From Chain to Three-Dimensional Shape
A flat chain of amino acids isn’t useful on its own. Proteins only work once they fold into a specific three-dimensional shape. This folding happens in four levels, each building on the last.
Primary structure is simply the sequence of amino acids in the chain, including any strong bonds that link different parts of the chain together. Think of it as the spelling of the protein.
Secondary structure forms when sections of the chain coil into spirals or fold into flat sheets. These shapes arise because of hydrogen bonds forming between the repeating backbone units of the chain. Two common patterns are the alpha helix (a corkscrew shape) and the beta sheet (a flat, pleated ribbon).
Tertiary structure is the overall three-dimensional shape of a single polypeptide. It results from interactions between the side groups of amino acids along the chain. Some side groups are water-repelling and get pushed to the protein’s interior, while others are attracted to water and stay on the surface. Additional forces, including hydrogen bonds and electrostatic interactions, pull different parts of the chain closer together until it collapses into a compact shape. Proteins often contain modular sections called domains, typically 40 to 350 amino acids each, that fold independently and perform specific functions.
Quaternary structure exists when two or more folded polypeptide chains join together into a larger complex. Insulin, for example, is a small hormone made of two separate chains held together by strong sulfur-to-sulfur bonds.
Chaperones: Folding Assistants
Folding correctly is not easy, especially for larger proteins. Long chains can stick to themselves or clump with other proteins before they finish folding. To prevent this, cells use helper proteins called molecular chaperones. First discovered as proteins that cells produce in response to heat stress, chaperones work by physically shielding the vulnerable, unfolded chain from its surroundings.
One major group of chaperones forms a barrel-shaped cage. An unfolded protein enters the cage, where it can fold in isolation without bumping into other molecules that might cause clumping. Another group recognizes short water-repelling stretches on unfolded proteins and binds to them temporarily, giving the chain time to fold step by step. Without chaperones, many proteins would aggregate into useless clumps rather than reaching their functional shape.
What Proteins Actually Do
The roughly 20,000 protein-coding genes in the human genome produce a staggering variety of molecules. Alternative ways of reading the same gene can expand the total number of distinct protein forms well beyond that count. These proteins fall into several broad categories based on function.
- Enzymes speed up nearly all of the thousands of chemical reactions happening inside your cells. They break down food, copy DNA, and build new molecules.
- Structural proteins give cells their shape and allow your body to move. Collagen provides strength to skin and tendons, while actin helps muscles contract.
- Transport and storage proteins carry atoms and small molecules where they’re needed. Ferritin, for instance, stores iron inside cells, while hemoglobin shuttles oxygen through your bloodstream.
- Signaling proteins like hormones carry messages between cells. Insulin, the two-chain protein mentioned earlier, tells your cells to absorb sugar from the blood.
- Immune proteins such as antibodies recognize and neutralize bacteria, viruses, and other invaders.
What Happens When Folding Goes Wrong
Because a protein’s function depends entirely on its shape, misfolding can have serious consequences. When amino acid chains fold incorrectly, they can stick together into dense, fibrous clumps called amyloid plaques. These aggregates accumulate in tissues and interfere with normal cell function.
Alzheimer’s disease is the most well-known example. It involves the buildup of misfolded amyloid-beta protein fragments and tangled tau proteins in the brain, leading to memory loss and cognitive decline. Parkinson’s disease involves clumps of a misfolded protein called alpha-synuclein accumulating in brain cells. Huntington’s disease results from an abnormally long repeating segment within the huntingtin protein that causes it to misfold and aggregate.
Misfolding diseases aren’t limited to the brain. Type 2 diabetes involves amyloid deposits of a small protein called amylin in the insulin-producing cells of the pancreas. Cataracts involve misfolded crystallin proteins in the lens of the eye. Even certain thyroid cancers are associated with amyloid formation from the hormone calcitonin. In each case, the fundamental problem is the same: a chain of amino acids that fails to reach or maintain its correct three-dimensional shape, turning a vital molecule into a source of damage.

