How Your Voice Actually Works: The Complete Science of Speaking
By Deepr Team
What happens in the half-second between thinking a word and saying it?
Your voice is produced by three systems working in sequence: your lungs push air out, that air vibrates two small flaps of tissue in your throat, and those vibrations get reshaped as they travel through your mouth and nose. The whole process takes about 50 milliseconds.
But here's what surprised me when I started researching this: your vocal cords aren't cords at all. They're folds of tissue with five distinct layers, each with a specific job. And the physics involved would make a mechanical engineer jealous - we're talking about tissue vibrating over 100 times per second, controlled by muscles you've never consciously thought about.
I've spent months reading research papers, talking to voice coaches, and experimenting with my own voice. What I've learned has changed how I think about speaking entirely. If you want to understand why your voice sounds the way it does - and whether you can change it - you need to understand the machinery first.
The three systems that create your voice
Voice production isn't one thing. It's three separate systems working together, each doing something completely different. The Voice Foundation breaks it down this way:
System 1: The power source (your breath)
Everything starts with air. Your lungs, diaphragm, rib cage, and chest muscles form what voice scientists call the "respiratory system" or breath support. Think of this as the generator.
When you exhale to speak, you're not just pushing air out. You're creating a controlled, steady stream of pressure that will power your vocal folds. The key word is "controlled." Shouting doesn't mean pushing harder - it means managing that airflow differently.
According to research from University of Mississippi Medical Center, diaphragmatic breathing - using your belly rather than your chest - produces a more powerful voice with less strain. This is because using the back and bottom of your lungs allows for more air, and therefore smoother airflow through your vocal folds.
A tense diaphragm can make your voice sound breathy, shaky, or uneven. Signs include shallow breathing, discomfort in your chest, and a weaker-than-usual voice.
System 2: The vibrator (your larynx)
This is where sound actually gets made. Your larynx - commonly called the voice box - sits in your throat and houses your vocal folds (also called vocal cords, though that term is falling out of favor because they're not cord-like at all).
The larynx is a surprisingly complex structure. According to the NCBI Bookshelf, it consists of a cartilaginous skeleton (thyroid, cricoid, and arytenoid cartilages), intrinsic muscles, extrinsic muscles, nerves, and a mucosal lining.
When you breathe normally, your vocal folds stay open to let air pass. When you want to speak, they close almost completely, and the exhaled air pushes through the tiny gap between them. This creates vibration.
Here's the counterintuitive part: your vocal folds aren't plucked like guitar strings. They're set into motion by aerodynamic forces. As air rushes through the narrow opening, it creates a suction effect (the Bernoulli effect) that pulls the folds back together. Then pressure builds up again, pushes them apart, and the cycle repeats. This happens incredibly fast.
System 3: The shaper (your vocal tract)
The raw sound produced by your vocal folds is just a buzz. It sounds nothing like your voice. That buzzy sound gets transformed as it travels through your vocal tract - your throat, mouth, nasal passages, and sinuses.
According to Toronto Speech Therapy, these structures act as resonating chambers. When sound waves pass through them, they get amplified at certain frequencies and dampened at others. This filtering is what gives your voice its unique character.
Your tongue, soft palate, and lips then shape this resonated sound into recognizable words. The whole system works together to turn a simple buzz into the complex sounds of human speech.
What are vocal folds actually made of?
This is where things get fascinating. Your vocal folds aren't simple flaps of tissue. According to Medscape, they have five distinct layers, each with a specific composition and function.
Layer 1: The epithelium (outer skin)
The outermost layer is a thin covering of squamous epithelium - essentially skin. It's about 0.05mm thick in the middle of the vocal fold. This layer maintains the shape of your folds, protects underlying tissue, and helps regulate hydration.
On the surface of these epithelial cells are tiny structures called microridges and microvilli. Their job? Spreading and retaining a mucous coat. Proper lubrication is essential for your vocal folds to vibrate smoothly - without it, you get excessive friction and irritation.
Layer 2-4: The lamina propria (the wobble zone)
Just beneath the epithelium is the lamina propria, and it's divided into three sublayers:
Superficial layer (Reinke's space): Voice scientists describe this as having the consistency of "soft gelatin." It's loose, pliable, and rich in substances like hyaluronic acid that regulate water content. This layer provides the cushion that allows your folds to vibrate freely. It makes up about 13% of the lamina propria's thickness.
Intermediate layer: This has the consistency of "a bundle of soft rubber bands." It's mostly elastic fibers that can stretch and snap back. At about 51% of the lamina propria, it's the thickest sublayer.
Deep layer: Described as a "bundle of cotton thread," this layer contains collagenous fibers that provide durability. It prevents your folds from stretching too far out of shape. The intermediate and deep layers together form what's called the vocal ligament.
Layer 5: The vocalis muscle (the engine)
At the very bottom sits skeletal muscle - the thyroarytenoid or vocalis muscle. This is the body of your vocal folds and gives them most of their bulk.
Here's something I didn't know until recently: research published in Hormones journal shows that the infant lamina propria is only composed of a single layer. There's no vocal ligament at birth. It starts appearing around age 4, the three defined layers develop between ages 6-12, and they're fully mature by the end of adolescence. Your voice is literally still forming through childhood.
How fast do vocal folds actually vibrate?
Faster than you'd guess. According to The Voice Foundation, vocal folds vibrate in excess of 100 times per second during voicing, and often much faster.
A population study of 2,472 participants found specific numbers:
- Men's vocal folds vibrate between 90-500 Hz, averaging about 115 Hz in normal conversation
- Women's vocal folds vibrate between 150-1000 Hz, averaging about 200 Hz in conversation
At soft speaking levels, the study found mean frequencies of 111.8 Hz for males and 161.3 Hz for females. At louder levels, these increased to 175.5 Hz for males and 246.2 Hz for females.
What controls this vibration rate? Two main factors: the tension in your vocal folds and their mass. Stretch your folds thinner and tighter, and they vibrate faster (higher pitch). Let them thicken and relax, and they vibrate slower (lower pitch). Your laryngeal muscles control these adjustments constantly as you speak.
Why is your voice different from everyone else's?
Part of it is anatomy. Adult male vocal folds are between 17mm and 25mm long. Female vocal folds are between 12.5mm and 17.5mm. Longer, thicker folds vibrate more slowly, producing a lower pitch.
But that's only the beginning. Your vocal tract - the shape and size of your throat, mouth, and nasal passages - acts like a unique acoustic fingerprint. When sound passes through these chambers, certain frequencies get amplified (these are called formants) while others get dampened.
According to research on resonance, the pharynx (the muscular tube from the back of your nose to your voice box) is the most important resonator by virtue of its position, size, and adjustability. The oral cavity is second, with the shape controlled by your tongue, jaw opening, and lips.
Even subtle differences in these structures create noticeable variations in voice quality. That's why identical twins can have distinguishable voices despite nearly identical genetics.
Why do voices crack - and how can you prevent it?
A voice crack is a momentary disruption in the muscular control of your vocal folds. According to Biology Insights, it happens when your folds suddenly stretch, shorten, or tighten beyond their intended coordination. The abrupt change in vibratory pattern causes a sudden pitch shift or temporary loss of voice.
The puberty factor
Testosterone causes the larynx to enlarge and vocal folds to grow longer and thicker. Research shows that during puberty, male voice pitch drops by about one octave. Adolescent boys' voices typically begin to mutate at 12-13 years and stabilize between 15-18.
The problem? Your brain and vocal muscles need time to adapt to these new dimensions. The neural pathways that controlled your childhood voice need to recalibrate for a completely different instrument. Until they do, pitch instability is common.
Female pubertal changes are less dramatic - pitch drops by only 3-4 semitones - but can include increased breathiness, occasional cracking, and pitch inaccuracy while singing.
Other common causes
Dehydration: Your vocal folds need moisture to vibrate smoothly. When they're dry, their movement becomes erratic. This is why singers obsess over hydration.
Emotional states: Nervousness, anxiety, and embarrassment tense your throat muscles. This restricts vocal fold movement and makes them prone to sudden, uncontrolled pitch changes.
Vocal fatigue: Prolonged speaking or singing can exhaust the tiny muscles controlling your folds, leading to temporary loss of control.
Prevention strategies
Based on Healthline's research:
- Drink at least 64 ounces of water daily. Room temperature is better than cold before speaking, as cold water can limit laryngeal muscle movement.
- Warm up your voice with exercises before extended speaking or singing.
- Avoid shouting or whispering excessively - both strain your vocal cords.
- Don't change pitch or volume too quickly. Modulate gradually.
- Manage stress through relaxation techniques like deep breathing.
- Allow your voice adequate rest after heavy use.
How do hormones change your voice throughout life?
Your voice isn't fixed. Hormones reshape it across your entire lifespan.
Testosterone's role
Research published in Hormones found that for the first 20 years of life, vocal folds grow at approximately 0.7mm per year in males and 0.4mm per year in females. This results in maximum adult lengths of about 16mm for men and 10mm for women.
Male testosterone levels in puberty are 20-30 times higher than in females of the same age. This leads to increased bulk of laryngeal muscles and ligaments, plus the secondary descent of the larynx - a male-specific change that elongates the vocal tract.
The aging voice
Here's something unexpected: men and women experience opposite vocal changes as they age.
In men over 65, testosterone levels have been declining by about 1% per year since age 30. This contributes to muscle wasting that also affects the vocalis muscle. The result? Vocal fold bowing, reduced glottal contact, increased breathiness, and a higher-pitched voice. Men's voices actually become more feminine-sounding with age.
Women experience the reverse. Post-menopausal hormonal changes often result in a lower speaking pitch.
So while young men have deeper voices than young women, elderly populations show less vocal difference between the sexes.
How does voice affect how others perceive you?
This is where voice science meets psychology, and the research is striking.
Deep voices and authority
According to a systematic review in Frontiers in Psychology, deeper voices are often perceived as more authoritative and trustworthy across various cultural contexts. Our evolutionary ancestors likely associated low-frequency tones with larger physical stature and dominance.
There's a bias toward deeper voices for leadership positions, regardless of the speaker's gender. Biologically, deeper voices signal higher testosterone levels, which correlates with perceived dominance and confidence.
But it's complicated
Here's where it gets interesting. Research on financial trust found that when deciding who to trust with money, people actually prefer higher-pitched voices. While lower pitch signals power, higher pitch may convey warmth and cooperativeness - qualities that matter when considering reciprocity.
A study from UC Berkeley found that "vocal warmth" (a soft, rounded quality) increased listener buy-in by 20% compared to neutral delivery.
Professionals adjust automatically
Research found that professors spoke in deeper, more resonant voices when answering questions requiring specialist knowledge - unconsciously projecting authority. Women lowered their pitch more than men when giving career advice versus casual directions.
We all do this to some degree. Your voice isn't static - you naturally adjust it based on context, often without realizing it.
Can you actually train your voice to change?
Yes, but with caveats.
A scoping review of voice training research found that the most effective approach for healthy individuals was Semi-Occluded Vocal Tract Exercises (SOVTE), particularly straw phonation. Positive effects were observed on both acoustic measurements and self-assessed vocal quality.
However, the review found optimal exercise durations of 5-7 minutes for men and 3-5 minutes for women. Prolonging exercises beyond this actually caused negative effects on voice quality and increased vocal discomfort.
Research on music students tracked 23 students over 18 months of training. They found significant improvement in the Dysphonia Severity Index and vocal range expansion. Voice training works - it just takes time and proper technique.
For teachers, long-term studies showed that trained individuals maintained better vocal health nearly two years after completing training programs, while untrained comparison groups showed vocal decline.
The key takeaways
- Your voice is a three-part system: breath provides power, vocal folds create vibration, and your vocal tract shapes the final sound.
- Vocal folds have five layers: Each with different mechanical properties, all developing through childhood and adolescence.
- Hormones reshape your voice across your lifespan: Men's voices deepen dramatically at puberty but rise again in old age. Women experience the opposite pattern.
- Voice cracks happen when neural control can't keep up with physical changes: They're preventable through hydration, warm-ups, and avoiding strain.
- How you sound affects how others perceive you: Deep voices convey authority, but warm voices build trust. Context matters.
- Voice training does work: But proper technique matters, and more isn't always better.
Understanding the machinery behind your voice is the first step toward using it more effectively. Whether you want to sound more confident, prevent vocal fatigue, or just satisfy curiosity about one of your most powerful communication tools, the science is clear: your voice is more trainable than you probably thought.
Apps like Deepr can help you track changes in your voice metrics over time - measuring depth, clarity, and confidence as you practice. But the real work happens when you understand what you're actually training.