
Stems are audio files that each hold a group of related tracks bounced down together — every drum on one file, every vocal on another, the instruments on a third — and they add back up to your complete mix. Think of a song split into a handful of labelled layers instead of one finished stereo file or sixty raw channels. They exist so that someone else, or future you, can re-balance, remix, or master the parts without needing your original session.
A stem is what you get when you take a group of channels that belong together — say, your kick, snare, hats, toms, and drum-bus compressor — and bounce just that group to a single stereo file. Solo the group, print it, and you have a drum stem. Do the same for bass, vocals, and instruments, and a 48-channel session collapses into five or six tidy files.
The defining property: the stems must sum back to the mix. Drop all of them into an empty session, line them up at the same start point, leave every fader at 0 dB, and they should play back as the exact mix you bounced — same balance, same effects, same level. If they don't add up, they aren't stems; they're just some bounces. That single rule is what separates a usable stem set from a folder of random exports.
This is where most of the confusion lives, and it matters because asking a client for "the stems" when you need the multitracks will cost you a day. Here's the whole hierarchy in one view.
| Term | What it is | Typical count | Used for |
|---|---|---|---|
| Track | One channel — a single mic, DI, or MIDI part | dozens per song | Building the mix |
| Multitrack | Every individual track exported on its own | 20–100+ | Full remix from scratch, archiving |
| Stem | A group of tracks (e.g. all drums) bounced to one file | 4–12 | Stem mastering, remix packs, sync, live |
| Master | The final mix bounced to one stereo file | 1 | Distribution and streaming |
So a track is the smallest unit and the master is the largest. Stems sit one level below the master: fewer files than multitracks, each already balanced and processed the way the mix engineer intended. Multitracks are raw and granular — you'd hand those over only when someone needs to rebuild the mix completely.
Stems aren't busywork. They're the format you reach for whenever the balance between parts needs to stay editable after the mix is done:

There's no fixed number — it depends on who's receiving them and why — but most songs reduce cleanly to four or five groups. This is the standard breakdown, and what each group usually contains:
| Stem | What's usually in it |
|---|---|
| Drums | Kick, snare, hats, toms, cymbals, percussion, and drum-bus processing |
| Bass | Bass guitar or synth bass, the sub, and any bass-specific compression or saturation |
| Vocals | Lead, doubles, harmonies, and ad-libs (often split into lead vs backing) |
| Music / instruments | Guitars, keys, synths, pads, strings — the harmonic and melodic bed |
| FX / other | Risers, impacts, foley, and reverb or delay throws printed on their own |
Scale up or down from there. A sparse singer-songwriter track might be three stems; a dense pop production handed to a sync library might be ten or more, with vocals split into lead, backing, and ad-libs and a separate instrumental and a cappella. When in doubt, ask the recipient for their deliverables list — sync libraries and mastering engineers usually have one.
Stems are delivered as lossless WAV or AIFF, never MP3. Match the session: if you tracked at 48 kHz / 24-bit, your stems are 48 kHz / 24-bit. The whole point of a stem is that the next person processes it further — mastering, re-balancing, time-stretching — and lossy compression throws away exactly the detail that processing exposes. An MP3 stem is a corrupted starting point, and you can't un-bake it.
A clean stem set lines up instantly and sounds identical to the mix. Most stem problems are avoidable; this is the checklist that prevents the email that starts with "these don't line up":
Aim for around -6 dBFS of peak headroom on stems destined for mastering. That gives the mastering engineer room to work and keeps any single stem from clipping when they're summed. Check it on the way out instead of guessing:
Stem mastering is mastering from a small number of stems instead of a single stereo file. It sits between mixing and mastering: the engineer can correct balance issues a stereo master can't touch — pull a hot vocal back, add weight to the bass, widen the music — while still doing the loudness, tonal, and dynamics work of a normal master. It costs more and takes longer than stereo mastering, but it's the move when the mix is close but not quite locked. If you're prepping a stem master yourself, set your targets first:
The exports are the easy part. The mess is everything after: a zip in a chat thread, a re-send because the vocal was named wrong, a client who can't tell the v3 stems from v2, and an mp3 "preview" floating around before anyone's paid. Stems are big, they get versioned, and they often gate a payment — which is exactly the workflow a delivery Room is built for: labelled files, a clean preview the client can hear, and the full-quality WAVs unlocked the moment the invoice clears.
Multitracks are every individual channel exported separately — each mic, DI, and MIDI part on its own file, often 20 to 100+ of them. Stems are groups of those tracks bounced together (all drums, all vocals), usually 4 to 12 files. Stems are already balanced and processed; multitracks are raw.
Most songs reduce to four or five: drums, bass, vocals, instruments, and FX. Sparse tracks may need only three; dense productions or sync deliverables can run ten or more with vocals split into lead, backing, and ad-libs plus a separate instrumental and a cappella. Ask the recipient for their deliverables list.
Lossless WAV or AIFF, matched to the session's sample rate and bit depth (commonly 48 kHz / 24-bit). Never MP3 — stems get processed further downstream, and lossy compression destroys the detail that mastering and re-balancing rely on.
Yes — a stem captures the mix exactly as it sounds, including its reverb, delay, and processing, so the set sums back to the full mix. Always let the effect tails ring out past the last note when you export, or the stem won't line up with the mix.
Yes — it's called stem mastering. Working from 4 to 6 stems lets the mastering engineer fix balance issues a stereo master can't reach, like a loud vocal or thin bass, while still doing normal loudness and tonal work. It costs more and takes longer than stereo mastering.