A reference for what reviewers actually mean when they describe how a headphone or IEM sounds.
Built on Mark Ryan's SuperReview framework with additional terms and frequency context.
Why This Exists
The language of sound
Audiophiles use words like warm, bright, thin, and incisive to describe how gear sounds. These words can feel made-up, but they map to specific, measurable behaviors of how a headphone reproduces frequencies. Once the vocabulary clicks, reading reviews stops feeling like decoding poetry and starts feeling like reading specs.
This reference covers the most common terms, what they actually describe, and what to listen for when you encounter them.
Foundation
The frequency spectrum
Every audio term ties back to where on the frequency spectrum something is happening. Human hearing covers roughly 20 Hz to 20,000 Hz. Reviewers slice that range into bands, and most descriptions refer to one or more of them.
Sub-bass
20–60 Hz
Felt more than heard. Rumble, weight.
Bass
60–250 Hz
Kick drums, bass guitar punch.
Lower Mids
250–500 Hz
Body, warmth of male vocals.
Mids
500 Hz–2 kHz
Most vocals, guitar, piano.
Upper Mids
2–5 kHz
Vocal presence, snare crack.
Treble
5–10 kHz
Cymbals, hi-hats, S sounds.
Air
10–20 kHz
Shimmer, sparkle, openness.
A headphone's frequency response is a graph showing how loudly it reproduces each frequency. A perfectly flat response (every frequency at the same volume) is technically neutral but usually sounds dull. Most headphones intentionally emphasize or de-emphasize certain bands. That emphasis pattern is what creates a "sound signature."
Sound Signatures
The five shapes you'll encounter
Most headphones fall into one of these signature families. The frequency response curves below show what each one looks like compared to a flat reference (dashed line).
Neutral / Reference
Balanced
A roughly flat response with small, controlled deviations. Aims to reproduce music as the engineers heard it during mastering.
Examples: Sennheiser HD 600, Etymotic ER series. Honest. Not flashy.
Warm
Bass-tilted
Elevated bass, slightly rolled-off treble. Fuller, richer, more "body" in the lower frequencies. Less contrast between instruments.
Examples: Sennheiser IE 600, ZMF Aeolus. Inviting. Forgiving of bad recordings.
Bright
Treble-tilted
Recessed bass, elevated treble. Vocals and high-frequency detail get pushed forward. More clarity, more space, more contrast.
Examples: Beyerdynamic DT 1990 Pro, Sennheiser HD 800 S. Analytical. Can be fatiguing.
V-Shape
Smile curve
Bass boosted, treble boosted, mids scooped out. Exciting at first listen. Loses vocal body and midrange detail.
Examples: Beats, many consumer headphones. Fun for parties. Fatiguing for long sessions.
Dark
Heavily warm
An exaggerated warm signature with substantially rolled-off treble. Bass-heavy, smooth, can feel "veiled" or congested.
Examples: Audeze LCD-2, some vintage tunings. Relaxing. Not for detail work.
Thin
Hollow mids
A scoop in the lower mids and upper bass leaves vocals and instruments feeling "hollow" or weightless. Different from bright — sub-bass can still be present.
Examples: Many cheap earbuds, some studio monitors. Lacking body. Vocals lose weight.
Bass Vocabulary
How reviewers describe low frequencies
Bass is the easiest range to describe because we feel it physically. But the words people use carry specific meanings.
Sub-bass
The deepest range, roughly 20-60 Hz. Felt more than heard. Provides rumble, weight, the physical thump of a kick drum or synth bass.
Listen for: Movie explosions, electronic bass drops, 808s. If you feel pressure in your chest, that's sub-bass.
Mid-bass
Roughly 60-250 Hz. Where most bass guitar notes and kick drum body live. Affects the warmth and fullness of male vocals too.
Listen for: Bass guitar lines, kick drum punch. This is the range that gives music its sense of body.
Bass Bloat
Too much mid-bass. Crowds out vocals, reduces separation between instruments, makes everything feel "smeared" together. Not the same as sub-bass emphasis, which is cleaner.
Listen for: Bass that drowns out other elements. Following the bass line becomes hard because there's no definition.
Slam
The physical impact of bass hits. A headphone with good slam delivers kick drums and bass drops with visceral force. Closely related to "punch."
Listen for: Kick drum hits. Does it feel like a solid thump or a soft pat?
Tight / Loose
How quickly bass starts and stops. Tight bass has clear leading and trailing edges. Loose bass smears together and feels sloppy or "boomy."
Listen for: Fast bass guitar runs. Can you hear each note distinctly or do they blur together?
Texture
The detail and complexity within bass notes. Good bass texture reveals fingerwork on bass strings, the resonance of the instrument body, the room the recording happened in.
Listen for: Acoustic bass on jazz recordings. You should hear the wood, the string, the breath of the instrument.
Midrange Vocabulary
The most important range
Most of the music you care about lives in the midrange. Vocals, guitar, piano, the body of nearly every instrument. Reviewers tend to be particularly precise about midrange descriptions because this is where headphones are easiest to get wrong.
Forward Vocals
The singer feels close, present, "in front of" the rest of the band. Created by emphasis in the upper mids around 2-4 kHz. Good for vocal-centric music.
Listen for: Does the vocalist feel like they're singing to you or behind a curtain of instruments?
Recessed Vocals
The singer sits back in the mix, behind the instruments. Created by a dip in the upper mids. Some people prefer this for orchestral or instrumental music where the voice shouldn't dominate.
Listen for: Vocals that feel further away than the guitar or drums.
Lush
Rich, full, slightly thick midrange. Notes feel substantial. Often paired with a slight warm tilt. Associated with tube amps and certain dynamic driver headphones.
Listen for: Vocals and acoustic guitars that feel "fed" and complete rather than wispy.
Smooth
No harsh peaks in the midrange. Vocals slide easily without any frequencies jumping out unpleasantly. Different from "rolled off" — detail can still be there, just not aggressive.
Listen for: Female vocals on emotional ballads. Do any notes make you flinch?
Honky / Nasal
A peak around 1-2 kHz that makes vocals sound like the singer is plugging their nose. Unpleasant coloration that affects most vocal music.
Listen for: Does the vocalist sound congested even though you know they're not?
Body / Weight
The sense of substance behind notes. A headphone with good body makes a cello sound like a cello — heavy, resonant. Without body, instruments feel hollow.
Listen for: String instruments. Does the cello feel like a piece of furniture or a violin pretending?
Treble Vocabulary
The detail and air range
Treble is where headphones differentiate themselves on perceived "detail." It's also where they cause the most listening fatigue when done wrong. The line between exciting and tiring is thin.
Detail / Resolution
The ability to hear small elements in a recording — reverb tails, breath sounds, finger noise on guitar strings. Often comes from treble emphasis, but real resolution is about driver speed and clarity throughout the range.
Listen for: Things you've never noticed in songs you know well. Background instruments, room ambiance.
Incisive
Sharp, defined leading edges on percussive sounds. A kick drum isn't just a low frequency — it's the thwack of the beater hitting the head. Incisiveness is how clearly you hear that attack.
Listen for: Snare drum cracks, hi-hat hits. Do they have a defined edge or sound smeared?
Sparkle
A pleasant emphasis in the upper treble (8-12 kHz) that adds shimmer to cymbals and air to vocals. The opposite of dark or rolled-off.
Listen for: Cymbal decay. Does it shimmer and float or just stop dead?
Air
Energy in the highest treble (10-20 kHz). Creates a sense of openness and space. Hard to hear directly but you notice when it's missing — music feels closed-in.
Listen for: The space around instruments, the sense of a recording venue's atmosphere.
Sibilance
Harsh, exaggerated emphasis on S and T sounds in vocals. Created by peaks around 5-8 kHz. Makes singers sound like they're hissing at you. Common pitfall in bright headphones.
Listen for: Words with multiple S sounds. "She sells seashells" should not hurt.
Harsh / Piercing
Treble that crosses from exciting into painful. Often caused by narrow, aggressive peaks in the 4-8 kHz range. Causes listener fatigue within minutes.
Listen for: Cymbal crashes that make you want to turn down the volume.
Spatial Vocabulary
Soundstage, imaging, and the illusion of space
Headphones can't actually create a soundstage the way speakers can. What they do is simulate spatial cues through careful timing, frequency response, and driver design. These terms describe that simulation.
Soundstage
The overall sense of space the music occupies. A wide soundstage feels like a large room or concert hall. A narrow soundstage feels like the music is between your ears. Open-back headphones generally have larger soundstages than closed-backs.
Listen for: Live recordings. Do you feel like you're in the venue or staring at the speakers?
Imaging
The ability to pinpoint exactly where each instrument is positioned in the stage. Sharp imaging means you can close your eyes and point to where the snare drum is in 3D space.
Listen for: Hard-panned instruments. Can you place them precisely or is the position fuzzy?
Separation
How distinct individual instruments and elements feel from each other. Good separation lets you focus on one element at a time. Poor separation smears everything together.
Listen for: Dense mixes. Can you follow the bass line while the guitar solos, or does it blur?
Depth
The front-to-back dimension of the soundstage. Some instruments feel closer, others further away. A headphone with good depth creates layers, not just left-right placement.
Listen for: Vocal vs. background harmonies. Do the backups feel set back behind the lead?
Holography
When imaging is so precise that instruments feel like they have physical, three-dimensional presence in space. Rare term, usually reserved for high-end gear.
Listen for: The sense that you could walk around an instrument and see its other side.
Intimate
A small, close soundstage where everything feels nearby. Not necessarily bad — works well for vocal-focused recordings, jazz trios, anything where you want to feel "in the room" rather than "in the hall."
Listen for: Studio vocal recordings. Does it feel like the singer is two feet away?
Other Useful Terms
The rest of the vocabulary
Terms that don't fit neatly into a frequency band but come up constantly in reviews.
Timbre
How accurately a headphone reproduces the natural tonal character of an instrument. Good timbre means a violin sounds like a violin, not a synthesized approximation of one. Bad timbre is the audiophile equivalent of uncanny valley.
Coloration
Any deviation from neutral. All headphones color the sound — the question is whether the coloration is pleasant or distracting. A "highly colored" headphone is far from neutral; an "uncolored" headphone is close to flat.
Speed
How quickly the driver responds to signal changes. Fast drivers (often planar or electrostatic) handle complex passages cleanly. Slow drivers can sound congested on busy music.
Decay
How a note fades out after it's played. Natural decay sounds like the instrument in real life — a cymbal ringing out, a piano string still vibrating. Artificial or truncated decay sounds clipped.
Musical vs. Analytical
A loose dichotomy. "Musical" gear prioritizes enjoyment and emotional engagement over strict accuracy. "Analytical" gear prioritizes detail and accuracy, sometimes at the cost of musical flow. Most listeners want something in between.
Coherent
A sense that all frequencies are arriving together as a unified whole, rather than feeling like separate drivers handling different ranges. Single-dynamic-driver IEMs often score high on coherence; some multi-driver hybrids can feel "disjointed."
Engaging
A vague but real quality. An engaging headphone makes you want to keep listening. Often comes from good dynamics, rhythmic drive, and emotional honesty rather than measurable specifications.
Veiled
As if a cloth is draped over the music. Reduced clarity in the upper mids and treble. Often paired with criticism of dark or warm headphones taken too far.
Congested
Music feels crowded, with poor separation between elements. Often caused by bass bloat encroaching on the mids, or by drivers that can't handle complex passages.
Applied Knowledge
Translating preferences into purchases
Once you know what you actually want, the right keywords help you filter reviews and find gear that matches.
Avoid: sibilant, peaky, harsh treble, hot upper mids, narrow peaks, V-shape (for long sessions)
One last thing. Reviewers are subjective. Two people listening to the same headphone with the same music can come to different conclusions about whether it's "warm" or "neutral." The vocabulary is a tool for communication, not an exact science. The best use of these terms is to read multiple reviews from people whose taste you've come to understand, and triangulate from there.