
Remove music from video but keep voice
Remove music from video but keep voice is really about protecting speech quality, not chasing total silence.
If you’re also trying to remove background music from video or remove soundtrack from video, a stem-balance approach usually sounds more natural.
Why “keep voice” is the real intent
Hard removal can leave:
thin, phasey speech
“underwater” tails after words
unnatural gaps where music masked room noise
A better target is: voice forward + bed low, unless you truly need voice-only.
The stem-balance method (simple and reliable)
Step 1: Separate, don’t delete
Start with AI Music Separator so you can control the mix instead of committing to a hard mute.
Step 2: Build a voice-first mix
A practical starting point:
Voice track: bring it up to comfortable speech level
Music/instrumental: bring it down until it’s present but not competing
If it still masks words: lower music a bit more (don’t over-process the voice)
Step 3: Export both versions
Voice-only (utility track)
Voice-first mix (publish track)
When to use a simpler tool
If your audio is mostly “voice + one music bed,” a quick split can be enough:
Use Vocal Remover when the “voice” is more like vocals/singing content.
Use Music Separation for fast previews.