
Remove music from video
Remove music from video can mean two different jobs: lowering a music bed for speech, or separating audio into voice/music.
If you also need to remove background music from video and take music out of video, start with a fast preview and choose the least-destructive method.
What you can (and can’t) remove
You can usually do well when:
The voice is loud and centered
The music is a simple bed behind speech
The recording isn’t heavily echoed
You may struggle when:
Voice and music are equally loud
The room is very reverby
The “voice” is actually multiple people + crowd noise
In those cases, “perfect removal” often sounds worse than a clean voice-first mix.
The fastest workflow (preview first, commit later)
Step 1: Run a quick split
Use Music Separation to generate a fast voice vs music preview.
Step 2: Decide your target output
Want publish-ready dialogue? Keep a little bed, just quieter.
Want voice-only for subtitles/ADR? Export voice-only and clean lightly.
Step 3: If the preview is messy, switch to control mode
Use AI Music Separator when you need more balancing options instead of hard removal.
Quick clarity checks (takes 30 seconds)
Phone speaker test: can you understand every sentence?
Listen for watery/metallic artifacts on S/T sounds
Check volume jumps when music hits
If artifacts show up, reduce aggressive removal and mix instead.
The “don’t waste time” rule
If your clip is long, trim it first:
Use Audio Cutter to isolate the exact section you need.
Then process only that part.