How Human Hearing Works: An Introduction to Psychoacoustics

Sound Is Perception, Not Just Physics

Sound begins as a physical phenomenon — pressure waves moving through a medium like air. But what we hear is a radically transformed version of that physical event, shaped by the anatomy of the ear, the processing power of the brain, and a lifetime of learned acoustic context. Psychoacoustics is the scientific study of this relationship between physical sound and perceived sound.

For audio engineers, producers, and equipment designers, psychoacoustics isn't an academic curiosity — it's the foundation of practical decisions about compression, EQ, spatial processing, and audio codec design.

The Anatomy of Hearing: A Brief Overview

Sound waves enter the outer ear (pinna), travel down the ear canal, and vibrate the eardrum. This mechanical vibration passes through three tiny bones in the middle ear (the ossicles) to the cochlea — a fluid-filled, spiral structure in the inner ear. The cochlea is lined with thousands of tiny hair cells tuned to different frequencies. These hair cells convert mechanical vibration into electrical nerve signals, which travel to the auditory cortex of the brain for interpretation.

Critically, the cochlea functions as a biological frequency analyzer. Different regions respond to different frequencies, and it is this spatial separation that forms the basis of our ability to distinguish pitch.

Key Psychoacoustic Phenomena

The Fletcher-Munson Curve (Equal Loudness Contours)

Human hearing is not equally sensitive at all frequencies. At low volumes, we perceive bass and treble as quieter than midrange frequencies, even when measured at equal sound pressure levels. The Fletcher-Munson curves (formalized as ISO 226 equal-loudness contours) map this sensitivity across frequencies and volume levels.

Practical implication: Mixes made at high volumes tend to sound thin at lower volumes — the engineer over-compensated for the boosted perception of bass at loud levels. Most experienced engineers mix at moderate listening levels (around 75–85 dB SPL) to avoid this trap. The "loudness" button on older stereos boosted bass and treble to compensate for this effect at quiet listening levels.

Auditory Masking

When two sounds occur simultaneously, a louder sound can make a quieter sound at a nearby frequency inaudible — this is called frequency masking. A similar effect occurs in time: a loud sound can mask a quieter sound that occurs just before or just after it (temporal masking).

Practical implication: This principle is the entire basis of lossy audio compression formats like MP3 and AAC. These codecs identify masked audio data — information the human ear statistically cannot perceive — and discard it, dramatically reducing file size. The quality of a codec depends largely on how accurately it models human auditory masking.

The Haas Effect (Precedence Effect)

When the same sound arrives at both ears with a very small delay (roughly 1–35 milliseconds), the brain fuses them into a single perceived sound, localized toward the first-arriving source. Beyond approximately 35ms, the delayed sound is perceived as a distinct echo.

Practical implication: Live sound engineers use the Haas effect to time-align delay speakers in large venues so audiences near the delay stacks still perceive sound as coming from the stage. In mixing, short pre-delays on reverbs can create a sense of space without the direct signal sounding wet.

Binaural Hearing and Sound Localization

Humans localize sound using two primary cues: interaural time difference (ITD — the tiny delay between the same sound arriving at each ear) and interaural level difference (ILD — volume differences between ears). Together, these cues allow the brain to triangulate the position of a sound source in three-dimensional space.

Binaural audio recording and processing (using dummy-head microphones or Head-Related Transfer Function modeling) attempts to recreate these cues for headphone listeners — the foundation of spatial audio technologies like Dolby Atmos for headphones.

Why This Matters for Audio Equipment

Understanding psychoacoustics explains seemingly counterintuitive phenomena in audio equipment:

Why some headphone tunings feel more "natural" than others despite measuring differently
Why room acoustics affect perceived frequency balance so dramatically
Why harmonic distortion from tube amplifiers is often described as "warm" or "musical"
Why some lossless audio files sound indistinguishable from high-quality compressed files in controlled listening tests

Conclusion

Psychoacoustics bridges physics and human experience. The better you understand how perception shapes what you hear, the more effectively you can make decisions about gear, recording environments, and mix decisions. Sound is ultimately an event that happens in the brain — and that's precisely what makes audio such a fascinating field.