
A spectrogram is a visual map of a sound, with time across the bottom, frequency up the side, and colour showing level at every point. Where a normal analyzer shows you the frequency balance right now, a spectrogram shows the whole arc of a sound over seconds or minutes, so you can see a hi-hat's shimmer, a bass note's sustain, and a stray 60 Hz hum all at once. It's the closest thing audio has to an X-ray.
Every spectrogram encodes three dimensions of information into a flat image. Once you internalise the three axes, the picture reads instantly:
That third axis is the whole point. A waveform tells you how loud the signal is overall; a spectrogram tells you which frequencies make up that loudness and how they move. Two completely different sounds can share an identical waveform shape and look totally different on a spectrogram.
Sounds leave recognisable fingerprints. Learn these five and you can diagnose a mix by eye:
| What you see | What it is | What to do about it |
|---|---|---|
| A bright, steady horizontal line | A sustained pitch or a tonal resonance | If unwanted, notch it with a narrow EQ cut |
| A thin horizontal line at 50/60 Hz and its multiples | Electrical hum from a ground loop | Fix at source or use a hum remover / high-pass |
| Vertical streaks top to bottom | Transients: drum hits, plosives, clicks | Normal for percussion; isolated clicks may be edits |
| A dense smear along the bottom | Low-end mud or rumble | High-pass non-bass tracks, clean up the sub |
| Fuzzy haze filling the high end | Hiss, noise, or lossy-codec artefacts | Denoise, or check you started from a lossless file |
The skill is pattern recognition, not maths. After a few sessions you'll glance at a vocal and immediately see the sibilant flares around 6 to 9 kHz, the breath noise between phrases, and whether the low end is clean or clouded with proximity boom.
They're cousins, and people mix up the names. A spectrum analyzer (the bouncing curve in your EQ) shows the frequency balance at a single instant, level on the Y axis, frequency on the X. A spectrogram rotates frequency onto the Y axis and uses the X axis for time, so it shows how that balance evolves. The analyzer answers 'what's loud right now?'; the spectrogram answers 'what happened across the whole song?'. You want both, for different jobs: the analyzer for live tone-shaping, the spectrogram for hunting problems that only show up over time.
It's not eye candy. A spectrogram solves specific, recurring problems that are hard to catch any other way:
That last point about the MP3 fingerprint is worth dwelling on. If someone sends you a 'WAV' and the spectrogram shows a brick-wall cliff at 16 kHz, the audio was lossy at some point in its life and re-wrapped, no amount of processing brings back what the encoder threw away. It's a strong argument for insisting on the original lossless file. When you hand off your own work, sending the real export through a delivery Room keeps that high end intact instead of letting a chat app re-encode it down to a cliff.

Brightness tells you relative level frequency-by-frequency, which is perfect for balance decisions. What it doesn't give you is a calibrated loudness number for delivery, the integrated LUFS your streaming platform cares about. For that you need a loudness meter, not a colour scale. Use the spectrogram to fix what's loud, and a LUFS meter to decide how loud the finished file should be.
The two views are complementary. A spectrogram might reveal a harsh 3 kHz buildup that's making the track feel louder and more fatiguing than the meter suggests; tame it, and you can often push the overall level higher before it gets harsh. Problems you can see are problems you can fix with intent rather than guesswork, the same reason engineers also keep an eye on transients when shaping punch.
Read time left-to-right, frequency bottom-to-top (bass low, treble high), and brightness as loudness (dark is quiet, bright is loud). Horizontal lines are sustained tones, vertical streaks are transient hits, and a smear at the bottom is usually low-end mud.
A spectrum analyzer shows the frequency balance at a single moment (frequency on X, level on Y). A spectrogram adds time as the X axis and puts frequency on the Y axis, with colour for level, so it shows how the frequency content changes over the whole track.
A bright or hot-coloured region. A sustained loud tone appears as a bright horizontal line at its frequency; a loud transient appears as a bright vertical streak spanning many frequencies at one instant.
Often, yes. Lossy encoders cut everything above a sharp ceiling, usually around 16 to 20 kHz. A hard horizontal cliff at the top of the spectrogram with nothing above it is a strong sign the audio was once lossy, even if it's now wrapped as a WAV.
A sustained single frequency. Musically that's a held note or a tonal pad; as a problem it's a resonance, feedback, or electrical hum (a thin line at 50 or 60 Hz and its multiples).