AudioUtils

How to Extract Audio from Video Files

Extract audio tracks from video files and save as MP3, WAV, or other formats. Step-by-step guide for any video format.

Every video file already contains the audio you want as a separate stream — extracting it is a question of pulling that stream out, not re-recording or re-mixing anything. The right method depends on the source container, what you plan to do with the audio next, and whether you can afford an extra round of lossy re-encoding. This guide covers every scenario, every common source format, and the exact decisions that determine the quality of your output.

Why People Extract Audio From Video

The use cases are broader than most people realize:

  • Lectures and webinars recorded as MP4 or MOV that students want to re-listen to during commutes, at 1.5x or 2x speed in a podcast app.
  • YouTube tutorials saved as offline video that work better as audio in a car or on a run.
  • Conference talks and interviews filmed but consumed audio-first.
  • Voice memos recorded on iPhone in QuickTime that arrive as .mov files but are functionally voice notes.
  • Podcasts recorded on camera (most modern podcasts have a video version) where the host needs to publish an audio-only feed alongside YouTube.
  • Music videos and live performances where the official audio release is unavailable.
  • Screen recordings from OBS, ScreenFlow, or QuickTime where only the narration matters.
  • Transcription prep — every transcription service (Otter, Rev, Whisper, Descript) accepts audio more efficiently than video and many charge by file size.
  • Sampling and music production where producers pull short audio segments from films, interviews, or live recordings as source material for tracks. Once extracted, cut the audio down to the exact clip you need before importing into your DAW.

In each case, the underlying operation is the same: separate the audio elementary stream from the video container.

What Is Actually Inside a Video File

A video file is a container — a wrapper format that holds one or more video streams, one or more audio streams, optional subtitle tracks, chapter markers, and metadata. The container and the codecs inside it are independent decisions:

  • MP4 (.mp4) — almost always carries AAC audio. Sometimes MP3, occasionally AC-3 for theatrical content.
  • MOV (.mov) — Apple's QuickTime container. Typically AAC for iPhone recordings; can carry PCM (uncompressed), ALAC, or AC-3 for professional workflows.
  • MKV (.mkv) — Matroska. The flexible one. Audio can be AAC, AC-3, DTS, FLAC, Vorbis, Opus, PCM, or TrueHD. MKV files of films often carry multiple audio tracks (English, foreign dubs, commentary).
  • WebM (.webm) — Google's web container. Audio is Vorbis or Opus, never AAC.
  • AVI (.avi) — legacy Microsoft container. Usually MP3 or PCM audio.
  • MTS / M2TS (.mts, .m2ts) — AVCHD, the format consumer camcorders and Blu-ray use. Audio is AC-3 or LPCM.
  • FLV (.flv) — Flash Video. Almost always AAC or MP3 audio. Rare in 2026 but still surfaces in old archives.

When you 'extract audio,' you are demuxing — pulling the audio stream out of the container. Whether re-encoding happens depends entirely on whether your target format matches the source codec.

Three Approaches: Browser, Desktop, Command-Line

There are three categories of tools, with very different trade-offs.

Browser-based (recommended for most users)

Tools like AudioUtils run FFmpeg compiled to WebAssembly directly in your browser tab. The video file is loaded into browser memory, processed locally, and the audio is downloaded back to your device. The file never touches a server. This matters for unreleased lectures, internal company recordings, interviews under NDA, or any video you would not upload to a random web service.

Workflow: drag the video into a converter like /mp4-to-mp3, /mov-to-mp3, /mp4-to-wav, or /mov-to-wav. Pick a bitrate. Download the result.

Strengths: no install, no upload, works on iPhone/Android/Chromebook/locked-down work computers, identical workflow on every OS. Limits: practical file size around 1-2 GB before browser memory pressure becomes an issue; not a great fit for batch processing 200+ files.

Desktop applications

  • HandBrake — primarily a video transcoder but can output audio-only. Overkill for a single extraction; useful when you also want to re-encode the video.
  • VLC — Media > Convert/Save will demux any container VLC plays. Free, every OS, but its conversion UI is awkward.
  • Audacity — opens MP4/MOV via its FFmpeg plugin. Best when you also want to edit, denoise, or normalize the audio.
  • Permute (macOS) — drag-and-drop, $10. Solid pick for Mac users who do this regularly.

Command-line: FFmpeg

The professional answer. FFmpeg is the engine behind virtually every other tool in this list (including the browser-based ones). It is free, scriptable, and handles every codec and container in existence.

Step-by-Step: Extracting Audio With AudioUtils

For a typical YouTube-downloaded MP4 or iPhone MOV file:

1. Open /mp4-to-mp3 for MP4 or /mov-to-mp3 for MOV in your browser. 2. Drag the video file into the converter. Files up to about 500 MB load smoothly on most computers; larger files may need a few seconds of patience. 3. Choose a bitrate. 192 kbps is the sweet spot for most extracted speech and music. 128 kbps is fine for voice-only content. 320 kbps for music you want to keep at the highest practical MP3 quality. 4. Click Convert. Processing takes 5-30 seconds depending on the source length and your CPU. 5. Download the .mp3.

For lossless intermediate output — for example, if you plan to load the audio into a DAW for editing — use /mp4-to-wav or /mov-to-wav instead. The output WAV is decoded PCM, ready for any editing software.

For Apple-ecosystem use where you want to keep the original AAC quality without an MP3 transcode, use /mp4-to-m4a. When the source is MP4 with AAC inside, this is a stream copy — no re-encoding happens, and the output is byte-identical AAC in an M4A container.

Output Format Decision Tree

Once you have the audio out of the video, the container choice depends on what you are going to do with it:

  • MP3 — sharing, sending to friends, uploading to a podcast host, listening on any device. Universal compatibility, manageable file size, lossy.
  • WAV — editing in a DAW, transcription with services that prefer PCM, archival for short clips, sampling. Uncompressed, large.
  • M4A (AAC) — Apple ecosystem, AirPods, anywhere you want better quality per byte than MP3. Especially efficient when the source video already has AAC audio (stream copy avoids transcoding entirely).
  • FLAC — long-term archival of the extracted audio. Lossless compression, half the size of WAV.
  • OGG / Opus — web playback, Discord, game development. Most efficient lossy codec available; not universally supported on legacy hardware.

Quality Preservation: When Extraction Is Lossless

This is the most important point in the article and the one most tutorials get wrong.

Extracting AAC audio from an MP4 file to an M4A file is lossless. No re-encoding happens. The audio bytes are copied from the MP4 container into an M4A container with their codec parameters preserved. The output is bit-identical to the audio that was inside the source video.

Extracting that same AAC to MP3 is lossy. The audio is decoded from AAC to PCM, then re-encoded to MP3. You incur the artifacts of two lossy codecs in series — once when the original video was created, again when you convert to MP3. At 192 kbps target MP3 from a 128 kbps AAC source, the result is audibly OK for casual listening but noticeably worse than the AAC original.

The rule: match codecs when possible. If the source has AAC, output AAC (M4A). If the source has Vorbis, output OGG. Only transcode when you need a specific format the recipient or platform requires.

FFmpeg One-Liners (Every Flag Explained)

For users comfortable on the command line, FFmpeg gives you exact control.

Stream copy AAC from MP4 (lossless, fastest): 'ffmpeg -i input.mp4 -vn -c:a copy output.m4a'. The flags: '-i input.mp4' specifies the input. '-vn' means 'no video' — drop the video stream. '-c:a copy' means 'audio codec: copy' — copy the audio bytes without re-encoding. Output is an M4A with the original AAC bytes. Process time: a few seconds for a 1-hour file.

Transcode to MP3 at 192 kbps (lossy but universal): 'ffmpeg -i input.mp4 -vn -c:a libmp3lame -b:a 192k output.mp3'. '-c:a libmp3lame' uses LAME, the gold-standard MP3 encoder. '-b:a 192k' sets bitrate to 192 kbps CBR. Use '-q:a 2' instead of '-b:a' for VBR (better quality at similar average rate).

Extract to WAV (lossless PCM): 'ffmpeg -i input.mp4 -vn -c:a pcm_s16le output.wav'. 'pcm_s16le' is 16-bit signed little-endian PCM, the standard CD-quality WAV format. Use 'pcm_s24le' for 24-bit if your source has higher bit depth.

Pick a specific audio track from a multi-track MKV: 'ffmpeg -i input.mkv -map 0:a:0 -c:a copy output.m4a'. '-map 0:a:0' means 'from input 0, take audio stream index 0.' Use '0:a:1' for the second audio track (commentary, foreign dub). Run 'ffprobe input.mkv' first to see what tracks exist.

Batch Conversion

For more than a handful of files, scripting beats clicking. A simple shell loop processes every .mp4 in a folder to a same-named .mp3 with the LAME encoder at 192 kbps. Replace the codec or bitrate as needed. For batch jobs above a hundred files, FFmpeg is dramatically faster than browser tools because it can process files in parallel and skip the browser-memory ceiling.

Multiple Audio Tracks

Films and concerts on MKV often carry several audio streams: original language, dubs, director's commentary, isolated music. Browser tools typically grab the default (track 0). To pick a specific track:

  • VLC: Audio menu > Audio Track > pick before exporting.
  • FFmpeg: '-map 0:a:N' where N is the track index. Use 'ffprobe' to list tracks.
  • HandBrake: Audio tab > Track dropdown shows all available streams.

Preserving Timestamps for Sync Workflows

If you are extracting audio for transcription that you will sync back to the video later, frame-accurate timing matters. Stream copy ('-c:a copy') preserves the original sample-accurate timing. Transcoding to MP3 introduces a small encoder delay (LAME adds about 576 samples / 13 ms of leading silence by default) that can offset transcripts. For transcription that must sync precisely, extract to WAV — PCM has no encoder delay.

File Size Expectations

Rough numbers for a one-hour source:

  • Original MP4 video (1080p, 5 Mbps video + 128 kbps audio): about 2.3 GB total
  • Extracted M4A (AAC stream copy): about 56 MB
  • Transcoded MP3 at 192 kbps stereo: about 86 MB
  • Transcoded MP3 at 128 kbps mono: about 56 MB
  • Extracted WAV (16-bit / 44.1 kHz stereo): about 605 MB
  • Extracted FLAC from the WAV: about 300 MB

The audio is always a tiny fraction of the original video file. Even uncompressed PCM audio is roughly 25% of a typical 1080p MP4.

Edge Cases

  • DRM-protected video (iTunes movie rentals, Netflix downloads, Disney+ offline files) cannot be extracted. The audio stream is encrypted at the container level. This is a hardware-enforced limit, not a tooling limit.
  • Corrupted or partially-downloaded videos sometimes have a recoverable audio stream even when the video stream is broken. FFmpeg's '-err_detect ignore_err' flag forces extraction past errors.
  • Variable frame rate (VFR) screen recordings from OBS or game capture can have audio sync drift. Extract to WAV first to bypass any container-level remapping.
  • Live-streamed videos saved as .ts (transport stream) or fragmented MP4 may have multiple audio segments concatenated; FFmpeg handles them correctly but some browser tools may stop at the first segment break.

Mobile Workflows

iPhone: open the video in the Files app, share to your browser, navigate to /mov-to-mp3 or /mp4-to-mp3, drop the file in, and download the audio back to Files. The whole operation works without installing anything because Safari supports WebAssembly and the Files app handles input and output.

Android: identical pattern with Chrome and the Files app or any file manager. Android also has native FFmpeg apps (FFmpeg Media Encoder, Termux + ffmpeg) for users who want command-line control.

For further format decisions, see the lossless vs lossy guide, and the audio bitrate explainer covers what bitrate to pick for the extracted file. If your source is specifically an MP4 from YouTube or another website, the extract audio from MP4 guide walks through the no-software workflow specifically. For background on why MP4 typically holds AAC audio, see what is AAC.

More to Read

How to Convert iPhone Voice Memo to MP3 FreeHow Audio Compression WorksBest Audio Format for WebsitesHow to Batch Convert Audio FilesDoes Converting MP3 to WAV Improve Quality?How to Convert MP3 to WAV for Music ProductionHow to Convert MP3 to WAV Without Losing QualityHow to Convert MP3 to WAV on Mac and WindowsHow to Convert WAV to MP3 Without Losing QualityWAV File Too Large? Convert to MP3How to Convert iPhone Voice Memo to MP3 FreeHow to Play M4A Files on Android (Convert to MP3)How to Convert FLAC to MP3 Without Losing QualityBest Bitrate for FLAC to MP3 ConversionConvert AAC to MP3: Best Quality SettingsHow to Extract Audio from MP4 FilesConvert iPhone MOV Video to MP3How to Convert WAV to MP3 (The Complete Guide)How to Convert MP3 to WAV for Editing and DAWsBest MP3 to WAV Settings for Editing and DAWsBest WAV to MP3 Bitrate for Music, Podcasts, and VoiceMOV to MP3 on Mac: Fastest Ways ComparedHow to Convert M4A to MP3 on iPhone Without a ComputerHow to Convert FLAC to MP3 on MacHow to Convert FLAC to MP3 on WindowsHow to Convert OGG to MP3 on MacHow to Convert MP4 to MP3 on MacHow to Convert MP4 to MP3 on iPhoneHow to Convert MP4 to MP3 on AndroidHow to Convert WMA to MP3 on MacHow to Convert AIFF to MP3 on MacHow to Convert MOV to MP3 on WindowsM4A to WAV: How to Convert and WhyHow to Convert FLAC to OGG VorbisHow to Convert AAC to WAV for EditingHow to Convert WMA to MP3 on WindowsHow to Convert AIFF to MP3 on WindowsHow to Convert OGG to MP3 on WindowsHow to Convert FLAC to MP3 on iPhoneHow to Convert AAC to MP3 on MacHow to Convert M4A to MP3 on Mac: 3 Easy MethodsHow to Convert Audio Files with AudacityHow to Convert Audio Files with VLCFLAC to AAC: Bitrate Guide and Practical StepsOGG to AAC: Cross-Platform Audio Migration GuideWMA to OGG: Escape the Windows Media EcosystemWMA to FLAC: Lossless Archiving of Your Old WMA LibraryFLAC to Opus: Web Streaming Optimization GuideAIFF to M4A: Apple Production Workflow GuideWAV to AIFF: Windows to Mac Audio WorkflowHow to Convert AAC to MP3 on iPhoneHow to Convert FLAC to MP3 on AndroidHow to Convert OGG to MP3 on AndroidHow to Convert WAV to MP3 on iPhoneHow to Convert AIFF to MP3 on iPhoneHow to Convert M4A to MP3 on WindowsOpus to MP3: Complete Conversion GuideConvert Audio on Linux: Command Line and Browser OptionsHow to Convert Audio Without Installing SoftwareHow to Convert WMA to MP3 on Mac (Step-by-Step Guide)OGG to FLAC: What to Expect from the ConversionAAC to FLAC: Convert and What to ExpectOpus to WAV: How to Convert and Why You Might Need ToWAV to Opus: The Web Developer's Audio GuideBest Audio Format for Speech-to-Text TranscriptionBest Audio Format for WhatsApp Voice MessagesAudio Formats Windows Media Player Plays NativelyAudio Formats VLC Supports and Its Conversion FeaturesAudio Formats Foobar2000 SupportsAudio Formats Plex Media Server SupportsKodi Audio Format: What Works & What Needs ConversionAudio Formats for PS4 and PS5 USB PlaybackAudio Formats for Xbox USB PlaybackAudio on Nintendo Switch: Limitations and WorkaroundsHow to Play FLAC on iPhone (iOS 11 and Later)How to Play FLAC on Android NativelyWAV to FLAC: Converting Without Any Quality LossAIFF to WAV: macOS to Windows Audio WorkflowM4A to OGG: Converting Apple Audio to Open-SourceOpus Bitrate Guide: 32, 64, 96, 128, 192 kbps ExplainedReduce Audio File Size Without Losing QualityAudio Format Support on Raspberry Pi with mpd and mopidyBest Audio Format in 2025: The Definitive GuideIs yt-dlp Legal? What You Need to KnowLegal Ways to Download Music for Offline ListeningCreative Commons Music for Content Creators: Full GuideWMA to MP3: What to Expect and How to ConvertAIFF to MP3: GarageBand Exports and Quality SettingsHow to Convert Audio on Mac: GarageBand & QuickTimeHow to Convert Audio on iPhone: Files App & BrowserHow to Batch Convert Audio Files: FFmpeg & BrowserExtract Audio from MP4 Without Software (Browser Method)How to Convert iPhone Voice Memo to MP3 (Free, No App)How to Convert Zoom Recording to MP3 (M4A or MP4 Export)How to Convert Google Meet Recording to MP3How to Extract Audio from a Zoom Webinar RecordingHow to Compress Audio in Audacity: Size & DynamicsFFmpeg Compress Audio: MP3, FLAC, Opus & AAC One-LinersCompress MP3 Without Losing Quality: What's PossibleHow to Make a Ringtone From an MP3 (iPhone & Android)How to Trim an MP3 Without Losing QualityHow to Cut Audio in Audacity (2026 Step-by-Step)How to Merge Audio Files: Three Real MethodsHow to Remove Vocals From a Song (Honest 2026 Guide)How to Record Audio on Mac: 2026 GuideHow to Record Audio on Windows: 2026 GuideHow to Record Audio on iPhone: 2026 GuideHow to Edit MP3 Metadata: Tools & WorkflowsHow to Find BPM of a Song: 5 MethodsHow to Split Audio Files: 3 Methods That WorkWhat Is MP3? The Format ExplainedWhat Is WAV? Everything You Need to KnowWhat Is FLAC? The Lossless Audio FormatWhat Is OGG? The Open Container Format ExplainedWhat Is M4A? Apple's Audio Format ExplainedWhat Is AAC? Advanced Audio Coding ExplainedWhat Is AIFF? Apple's Lossless Audio FormatWhat Is WMA? Windows Media Audio ExplainedAudio Bitrate Explained: What It Means for QualitySample Rate Explained: 44.1kHz vs 48kHz vs 96kHzMP3 vs WAV: Which Format Should You Use?MP3 vs FLAC: Lossy vs Lossless ComparedMP3 vs AAC: Which Codec Sounds Better?MP3 vs OGG (Vorbis): The Complete ComparisonFLAC vs WAV: Lossless Formats ComparedM4A vs MP3: Which Should You Choose?Lossless vs Lossy Audio: The Complete GuideBest Audio Format for Music ProductionBest Audio Format for PodcastsBest Audio Format for GamingBest Audio Format for Music StreamingBest Audio Format for Archiving MusicWhy WAV Files Are So Large (And What to Do About It)MP3 vs WAV for Audio Editing in a DAWWhen Should You Convert MP3 to WAV?Convert WAV to MP3 for Sharing and EmailM4A vs MP3: Which Has Better Quality and Smaller Size?What Is M4A? The iPhone Audio Format ExplainedHow to Convert MP3 to OGG for Unity Game DevelopmentOGG vs MP3 for Web Audio: Which Should You Use?WAV vs AIFF: Which Uncompressed Format?AAC vs OGG: Which Lossy Codec Wins?Opus vs MP3: The Modern Codec ShowdownM4A vs AAC: What's the Difference?What Is Opus? The Modern Audio Codec ExplainedMP3 vs WMA: Which Format Should You Choose?AAC vs FLAC: Lossy or Lossless — Which to Choose?OGG vs Opus: What's the Difference?Best Audio Format for Discord in 2026Best Audio Format for Video EditingAudio File Size Comparison: MP3, WAV, FLAC, OGG, AACOpus Audio for Web Developers: A Practical GuidePrivacy-First Audio Conversion: Why Browser-Based MattersAudacity vs AudioUtils: Which Should You Use?AIFF vs FLAC: Which Lossless Format Is Better?WMA vs MP3: Which Sounds Better?OGG vs AAC: Which Audio Codec Is Better?M4A vs OGG: Which Lossy Audio Codec to UseBest Audio Format for Zoom RecordingsBest Audio Format to Use in AudacityBest Audio Format for Voice RecordingWhat Is Vorbis? The Open Audio Codec ExplainedWhat Is ALAC? Apple Lossless Audio ExplainedGarageBand Audio Formats: What to Use and WhyiTunes and Apple Music Audio Formats ExplainedAudio Sample Rates: 44.1, 48, 96 kHz ExplainedWhat Is HLS Audio? HTTP Live Streaming ExplainedAIFF vs. AIF: What Is the Difference?Best Audio Format for iMovie: Import and Export GuideAdobe Premiere Pro Audio Format GuideLogic Pro Audio Guide: Best Import & Export SettingsOBS Studio Audio Format and Settings GuideTwitch Audio Requirements: Format, Bitrate & QualitySpotify Audio Format: What You Need to KnowYouTube Audio Requirements: Quality, Format & LUFSTikTok Audio Requirements: Format, Bitrate, and QualityAndroid Audio Formats: Native Support and Best PracticesiPhone Audio Formats: What iOS Supports & Doesn'tBest Audio Format for Ringtones: iPhone and AndroidBest Audio Format for Car USB: MP3, FLAC, or WAV?MP3 Bitrate Guide: 128 to 320 kbps ExplainedFLAC vs Opus: When to Use Each Audio CodecWAV vs MP3: The Honest Quality ComparisonAAC vs. MP3 for Streaming: Which Is Better?Best Audio Format for AudiobooksFFmpeg vs. AudioUtils: When to Use EachAudio Formats for Podcast Apps: Spotify, Apple, and MoreAudio Bitrate vs. Sample Rate: What's the Difference?Audio Transcoding vs. Converting: What Is the Difference?OGG vs FLAC: Which Should You Use?Opus vs AAC: Which Codec Is Better?WAV vs FLAC for Archiving: Which Is Best?M4A vs FLAC: Apple AAC vs Lossless Quality ComparedMP3 vs AAC for AirPods: Does the Codec Matter?Audio Normalization: Peak vs Loudness — When to Use EachAudio Quality Settings: Bitrate, Sample Rate, Bit DepthMP3 vs. WAV for Podcasting: Which Format to UseBest Audio Format for Discord: Opus, MP3, and File LimitsBest Audio Format for TikTok: Specs and Upload TipsBest Audio Format for Instagram Reels and StoriesAudio Sample Rate Explained: 44.1 vs 48 vs 96kHzFLAC vs. ALAC: Lossless Audio Format ComparisonWhat Is VBR vs CBR? Bit Allocation in Audio EncodingAudio File Too Large? How to Reduce Audio File SizeAudio Formats for Zoom: Recordings, Uploads, and SharingContainer vs Codec: The Most Confusing Thing in AudioPCM Audio Explained: Why WAV Files Are So LargeVBR vs CBR for MP3: When Each Mode Is the Right ChoiceMP3 128 kbps vs 320 kbps: Does the Difference Matter?FLAC vs WAV for Music Production: The Practical AnswerM4A vs MP3 for iPhone: Which Format to Use and WhenOGG Vorbis vs MP3: Quality, Compatibility & When OGG WinsBest Audio Format for YouTube Uploads in 2026Best Audio Format for Audacity: Import, Edit, and ExportBest Audio Format for Premiere Pro: Timelines & ExportAudio Bitrate Guide: Right Settings for Every Use CaseWhy Is My Audio File So Large? How to Reduce ItLossless Audio: Is It Worth It? The Honest AnswerMP3 File Corrupted: How to Diagnose and Fix ItAudio Format for Spotify: Upload Specs & What HappensBest Free Audio Converter: Browser-Based vs DesktopAudio Compression Explained: File Size vs Dynamic RangeID3 Tags Explained: MP3 Metadata Standard