AudioUtils

PCM Audio Explained: Why WAV Files Are So Large

PCM (Pulse Code Modulation) is raw digital audio — no compression, every sample stored exactly. Learn bit depth, sample rate, and why DAWs prefer PCM.

What PCM Audio Actually Is

PCM stands for Pulse-Code Modulation. It is the method by which a continuous analog sound wave is turned into a stream of numbers a computer can store, and it is the foundation of essentially all digital audio you have ever heard. CDs are PCM. WAV files are PCM. The audio your DAW processes internally is PCM. Bluetooth, when it is not using a separate codec like aptX or LDAC, ultimately decodes back to PCM. The microphone in your laptop converts analog voltage into PCM samples before the operating system ever touches the data.

PCM is not a compression scheme and not a file format. It is a representation. The samples produced by PCM can be wrapped in any container — WAV, AIFF, BWF, Matroska, MP4 — or written to a CD as raw audio with no container at all. Whenever you read about "uncompressed audio," "raw audio," or "lossless PCM," you are reading about the same underlying thing.

How Sampling Turns Sound Into Numbers

Sound is a pressure wave. A microphone converts that pressure wave into a continuously varying voltage. A digital audio system has to turn that continuous voltage into a finite list of numbers, and PCM does this with two simple operations performed many thousands of times per second:

1. Measure the voltage at a precise instant. This is called sampling. 2. Round that voltage to the nearest available numerical value. This is called quantization.

That is the entire idea. Repeat it 44,100 times every second and you have a CD. The two parameters you can vary are how often you sample and how many distinct values you can round to.

Sample Rate

Sample rate is the number of samples taken per second, expressed in hertz (Hz) or kilohertz (kHz). The Nyquist-Shannon sampling theorem proves that to faithfully capture a signal containing frequencies up to F hertz, you need to sample at least 2F times per second. Human hearing tops out around 20 kHz, so a sample rate of at least 40 kHz is required to capture everything the ear can hear.

Common sample rates and where you encounter them:

  • 8 kHz — landline telephony and old voicemail. Captures up to 4 kHz, fine for intelligible speech and useless for music.
  • 16 kHz — wideband VoIP (Microsoft Teams, Zoom in HD mode, modern smartphone codecs).
  • 22.05 kHz — half of CD rate. Used in old multimedia and some game audio engines.
  • 32 kHz — broadcast radio and miniDV camcorders.
  • 44.1 kHz — Audio CD, the de facto standard for consumer music files.
  • 48 kHz — the universal standard for video and broadcast. YouTube, every major NLE (Premiere, Resolve, Final Cut), every camera, and every video game uses 48 kHz internally.
  • 88.2 / 96 kHz — high-resolution music and professional recording. Useful headroom for pitch shifting and time stretching.
  • 176.4 / 192 kHz — high-end mastering and audiophile distribution. Most engineers consider anything above 96 kHz to be a marketing exercise rather than an audible improvement.

The 44.1 kHz figure is not arbitrary. It was chosen in the late 1970s because early digital recorders were built on top of video tape transports, and 44.1 kHz fits cleanly into both NTSC and PAL video frame rates.

Bit Depth

Bit depth is the number of bits used to represent each sample. It controls how many distinct amplitude values you can quantize to, and therefore how small a level change you can represent before rounding kicks in.

  • 8-bit — 256 possible values per sample. Audible quantization noise. Used historically in early computers and still seen in retro game audio.
  • 16-bit — 65,536 values per sample. Roughly 96 dB of dynamic range. The CD standard, and inaudibly clean for any final listening scenario.
  • 24-bit — 16,777,216 values per sample. Roughly 144 dB of dynamic range. The recording and mixing standard. The extra headroom is not for listening — it is for the dozens of times the signal will be scaled, summed and processed inside a mix.
  • 32-bit float — uses IEEE 754 floating-point numbers instead of integers. The dynamic range is effectively unlimited and clipping inside the mix bus becomes mathematically reversible. Used internally by every modern DAW and increasingly as a recording format on devices like the Zoom F3 and Sound Devices MixPre.

A common misconception is that higher bit depth makes recordings sound "more detailed." It does not — it lowers the noise floor. For final listening, 16-bit is transparent. For a recording you intend to process, 24-bit (or 32-bit float) protects you from the cumulative quantization error of a long signal chain.

How PCM Differs From MP3, AAC and Other Lossy Codecs

PCM stores every sample. MP3, AAC, Opus and Vorbis throw most of them away.

A lossy codec uses a psychoacoustic model — a body of research about which sounds the human ear cannot perceive when other sounds are present — to discard whatever the model predicts you will not miss. A 320 kbps MP3 is about a quarter of the size of the equivalent CD-quality PCM, and a 96 kbps Opus stream is about a sixteenth. The audio that comes out of the decoder is not the audio that went in. It is an approximation that fits within the bit budget you specified.

This matters in three concrete ways for anyone working with audio:

1. Compounding loss. Each time you re-encode a lossy file you lose more information. PCM has nothing to lose — copy it a thousand times and the thousandth copy is bit-identical to the first. 2. Editing precision. Lossy codecs work on frames (typically 20-50 ms blocks). Sample-accurate edits require decoding to PCM, editing, and re-encoding, which is itself a generation loss. 3. Effects processing. Every plugin and every DAW operates on PCM internally. An MP3 dropped onto a track is decoded to PCM the moment it loads.

If you want to read more about the difference, see lossless vs lossy audio and the WAV format explainer.

Why WAV Files Are So Large: The File Size Math

The math behind PCM file size is exact. Multiply sample rate, bit depth, channels and duration:

bytes = sample_rate × (bit_depth / 8) × channels × seconds

A four-minute stereo song at CD quality (44.1 kHz, 16-bit, 2 channels):

  • 44,100 × 2 × 2 × 240 = 42,336,000 bytes ≈ 40.4 MB

Bumping to 48 kHz / 24-bit, the same four minutes becomes 66 MB. At 96 kHz / 24-bit it is 132 MB. A one-hour 96 kHz / 24-bit stereo recording is just under 2 GB. Multitrack sessions multiply that by the number of tracks: a 32-track album recording at 96/24 will eat through 60 GB of disk for raw audio alone before any takes, comps or alternates are stored.

By comparison, the same four-minute song as a 320 kbps MP3 is about 9.6 MB and as a typical FLAC is about 22 MB. MP3 is throwing away about 77% of the bits. FLAC is losslessly removing redundancy in the PCM stream — every FLAC decoder produces bit-identical PCM output, no matter who encoded the file or which compression level was used. Read more in what is FLAC.

WAV, AIFF, BWF and the Container Question

PCM samples have to live inside something. The file you actually open is the container — a wrapper that adds a header describing how to interpret the bytes that follow.

  • WAV — Microsoft and IBM, 1991. Built on the RIFF chunk format. Universal compatibility. Native 4 GB file size limit (RF64 extends this).
  • AIFF — Apple, 1988. Built on Electronic Arts' IFF format. Functionally identical PCM data; different chunk structure. Slightly better metadata in the original spec.
  • BWF (Broadcast WAV) — EBU, 1997. WAV plus a bext chunk that stores timecode, scene/take, originator and other production metadata. The standard delivery format for film and TV post-production.
  • CAF — Apple Core Audio Format. Used by Logic and some iOS apps. No 4 GB limit.
  • W64 (Sony Wave64) — extends WAV past the 4 GB barrier for very long recordings.
  • Raw PCM — no header at all. The receiver must already know the sample rate, bit depth and channel count.

When you convert MP3 to WAV, FLAC to WAV, or WAV to MP3, the WAV side of the operation is always the PCM side. Decoders produce PCM, encoders consume PCM.

PCM Variants: LPCM, DPCM, ADPCM, μ-law and A-law

When someone says "PCM," they almost always mean Linear PCM (LPCM) — uniform quantization steps, every sample takes the same number of bits, samples are stored end to end. This is what is in WAV, AIFF and CDs.

But the umbrella covers a few variations worth knowing about:

  • DPCM (Differential PCM) stores the difference between successive samples instead of the absolute values. Because consecutive samples in real audio are usually similar, the differences fit in fewer bits.
  • ADPCM (Adaptive DPCM) extends DPCM by adapting the step size dynamically based on signal content. Used in older game audio, voice memos on some legacy devices, and the IMA ADPCM and Microsoft ADPCM variants you sometimes see inside WAV files.
  • μ-law (mu-law) is a non-linear 8-bit encoding used in North American and Japanese telephony. It allocates more resolution to quiet signals where the ear is more sensitive, giving the perceived dynamic range of roughly 14 linear bits in 8 bits per sample.
  • A-law is the European telephony equivalent, with a slightly different companding curve.

If a WAV file refuses to import into a DAW with a "format not supported" error, it is usually because the WAV contains μ-law, A-law or ADPCM rather than plain LPCM. Decoding it to a standard 16-bit LPCM WAV with our WAV converter or ffmpeg fixes the issue.

Why DAWs Decode Everything to PCM Before Processing

Open Logic, Pro Tools, Ableton, Reaper, Cubase or any other DAW, drag an MP3 onto a track, and the file gets decoded to PCM the moment the track is created. Some DAWs do this lazily and stream-decode on playback; others bake out a PCM rendered file in the project folder. Either way, no plugin is ever fed MP3 frames.

There are four practical reasons for this:

1. Plugin signal flow assumes PCM. Every VST, AU and AAX plugin operates on a buffer of float samples. There is no MP3-aware EQ. 2. Sample-accurate timing. PCM lets you cut, fade and quantize to a specific sample. MP3 frames are 1,152 samples long and you cannot edit inside one. 3. Mathematical reversibility. Many operations (gain, panning, summing) are loss-free in float PCM and lossy in any compressed format. 4. Predictable CPU. PCM playback is essentially memory-bound. Decoding 80 MP3 tracks in parallel would dwarf the cost of the actual mixing.

This is why "best practice" advice for music production and Audacity is always: keep your working files in WAV (or FLAC if disk space is tight), and only export to MP3, AAC or Opus at the very end of the chain.

When PCM Is the Right Choice — And When It Is Not

Use PCM when:

  • Recording. Capture in 24-bit at the project's sample rate (48 kHz for video, 44.1 or 48 kHz for music).
  • Mixing and mastering. Every working file should be PCM. Bouncing between FLAC and PCM is fine; bouncing through MP3 is not.
  • Delivery to clients, broadcasters, mastering engineers and aggregators. They expect WAV or BWF.
  • Archiving project sessions. PCM in a 24-bit WAV will still open in 50 years.

Use a compressed format when:

  • Distributing to listeners. Convert your PCM master to AAC, MP3 or Opus. A typical 256 kbps AAC distributed via Spotify or Apple Music is indistinguishable from PCM for the overwhelming majority of listeners.
  • Embedding in video for the web. Use AAC inside MP4 or Opus inside WebM.
  • Storing voice memos and meeting recordings, where size matters far more than fidelity.

PCM is the correct internal representation. It is rarely the correct distribution format.

More to Read

What Is MP3? The Format ExplainedWhat Is WAV? Everything You Need to KnowWhat Is FLAC? The Lossless Audio FormatWhat Is OGG? The Open Container Format ExplainedWhat Is M4A? Apple's Audio Format ExplainedWhat Is AAC? Advanced Audio Coding ExplainedWhat Is AIFF? Apple's Lossless Audio FormatWhat Is WMA? Windows Media Audio ExplainedAudio Bitrate Explained: What It Means for QualitySample Rate Explained: 44.1kHz vs 48kHz vs 96kHzWhy WAV Files Are So Large (And What to Do About It)What Is M4A? The iPhone Audio Format ExplainedWhat Is Opus? The Modern Audio Codec ExplainedAudio File Size Comparison: MP3, WAV, FLAC, OGG, AACWhat Is Vorbis? The Open Audio Codec ExplainedWhat Is ALAC? Apple Lossless Audio ExplainediTunes and Apple Music Audio Formats ExplainedWhat Is HLS Audio? HTTP Live Streaming ExplainedAIFF vs. AIF: What Is the Difference?Android Audio Formats: Native Support and Best PracticesiPhone Audio Formats: What iOS Supports & Doesn'tMP3 Bitrate Guide: 128 to 320 kbps ExplainedAudio Bitrate vs. Sample Rate: What's the Difference?Audio Transcoding vs. Converting: What Is the Difference?Audio Normalization: Peak vs Loudness — When to Use EachAudio Quality Settings: Bitrate, Sample Rate, Bit DepthAudio Sample Rate Explained: 44.1 vs 48 vs 96kHzWhat Is VBR vs CBR? Bit Allocation in Audio EncodingContainer vs Codec: The Most Confusing Thing in AudioAudio Bitrate Guide: Right Settings for Every Use CaseAudio Compression Explained: File Size vs Dynamic RangeID3 Tags Explained: MP3 Metadata StandardMP3 vs WAV: Which Format Should You Use?MP3 vs FLAC: Lossy vs Lossless ComparedMP3 vs AAC: Which Codec Sounds Better?MP3 vs OGG (Vorbis): The Complete ComparisonFLAC vs WAV: Lossless Formats ComparedM4A vs MP3: Which Should You Choose?Lossless vs Lossy Audio: The Complete GuideAudio Formats Explained: The Complete GuideHow to Convert Audio Files: Complete GuideHow to Reduce Audio File Size Without Losing QualityHow to Convert iPhone Voice Memo to MP3 FreeHow Audio Compression WorksBest Audio Format for WebsitesHow to Batch Convert Audio FilesHow to Extract Audio from Video FilesBest Audio Format for Music ProductionBest Audio Format for PodcastsBest Audio Format for GamingBest Audio Format for Music StreamingBest Audio Format for Archiving MusicDoes Converting MP3 to WAV Improve Quality?How to Convert MP3 to WAV for Music ProductionHow to Convert MP3 to WAV Without Losing QualityMP3 vs WAV for Audio Editing in a DAWWhen Should You Convert MP3 to WAV?How to Convert MP3 to WAV on Mac and WindowsHow to Convert WAV to MP3 Without Losing QualityConvert WAV to MP3 for Sharing and EmailWAV File Too Large? Convert to MP3How to Convert iPhone Voice Memo to MP3 FreeHow to Play M4A Files on Android (Convert to MP3)M4A vs MP3: Which Has Better Quality and Smaller Size?How to Convert MP3 to OGG for Unity Game DevelopmentOGG vs MP3 for Web Audio: Which Should You Use?WAV vs AIFF: Which Uncompressed Format?AAC vs OGG: Which Lossy Codec Wins?Opus vs MP3: The Modern Codec ShowdownM4A vs AAC: What's the Difference?How to Convert FLAC to MP3 Without Losing QualityBest Bitrate for FLAC to MP3 ConversionConvert AAC to MP3: Best Quality SettingsHow to Extract Audio from MP4 FilesConvert iPhone MOV Video to MP3How to Convert WAV to MP3 (The Complete Guide)How to Convert MOV to MP3 (iPhone & QuickTime)How to Convert MP3 to WAV for Editing and DAWsHow to Convert YouTube to MP3 Legally (3 Ways)Best MP3 to WAV Settings for Editing and DAWsBest WAV to MP3 Bitrate for Music, Podcasts, and VoiceMOV to MP3 on Mac: Fastest Ways ComparedHow to Convert M4A to MP3 on iPhone Without a ComputerHow to Convert FLAC to MP3 on MacHow to Convert FLAC to MP3 on WindowsHow to Convert OGG to MP3 on MacHow to Convert MP4 to MP3 on MacHow to Convert MP4 to MP3 on iPhoneHow to Convert MP4 to MP3 on AndroidHow to Convert WMA to MP3 on MacHow to Convert AIFF to MP3 on MacHow to Convert MOV to MP3 on WindowsMP3 vs WMA: Which Format Should You Choose?AAC vs FLAC: Lossy or Lossless — Which to Choose?OGG vs Opus: What's the Difference?Best Audio Format for Discord in 2026Best Audio Format for Video EditingM4A to WAV: How to Convert and WhyHow to Convert FLAC to OGG VorbisHow to Convert AAC to WAV for EditingOpus Audio for Web Developers: A Practical GuidePrivacy-First Audio Conversion: Why Browser-Based MattersHow to Convert WMA to MP3 on WindowsHow to Convert AIFF to MP3 on WindowsHow to Convert OGG to MP3 on WindowsHow to Convert FLAC to MP3 on iPhoneHow to Convert AAC to MP3 on MacHow to Convert M4A to MP3 on Mac: 3 Easy MethodsHow to Convert Audio Files with AudacityHow to Convert Audio Files with VLCAudacity vs AudioUtils: Which Should You Use?AIFF vs FLAC: Which Lossless Format Is Better?WMA vs MP3: Which Sounds Better?OGG vs AAC: Which Audio Codec Is Better?M4A vs OGG: Which Lossy Audio Codec to UseBest Audio Format for Zoom RecordingsBest Audio Format to Use in AudacityBest Audio Format for Voice RecordingGarageBand Audio Formats: What to Use and WhyAudio Sample Rates: 44.1, 48, 96 kHz ExplainedFLAC to AAC: Bitrate Guide and Practical StepsOGG to AAC: Cross-Platform Audio Migration GuideWMA to OGG: Escape the Windows Media EcosystemWMA to FLAC: Lossless Archiving of Your Old WMA LibraryFLAC to Opus: Web Streaming Optimization GuideAIFF to M4A: Apple Production Workflow GuideWAV to AIFF: Windows to Mac Audio WorkflowBest Audio Format for iMovie: Import and Export GuideAdobe Premiere Pro Audio Format GuideLogic Pro Audio Guide: Best Import & Export SettingsOBS Studio Audio Format and Settings GuideTwitch Audio Requirements: Format, Bitrate & QualitySpotify Audio Format: What You Need to KnowYouTube Audio Requirements: Quality, Format & LUFSTikTok Audio Requirements: Format, Bitrate, and QualityBest Audio Format for Ringtones: iPhone and AndroidBest Audio Format for Car USB: MP3, FLAC, or WAV?How to Convert AAC to MP3 on iPhoneHow to Convert FLAC to MP3 on AndroidHow to Convert OGG to MP3 on AndroidHow to Convert WAV to MP3 on iPhoneHow to Convert AIFF to MP3 on iPhoneHow to Convert M4A to MP3 on WindowsOpus to MP3: Complete Conversion GuideFLAC vs Opus: When to Use Each Audio CodecWAV vs MP3: The Honest Quality ComparisonAAC vs. MP3 for Streaming: Which Is Better?Best Audio Format for AudiobooksConvert Audio on Linux: Command Line and Browser OptionsFFmpeg vs. AudioUtils: When to Use EachAudio Formats for Podcast Apps: Spotify, Apple, and MoreHow to Convert Audio Without Installing SoftwareHow to Convert WMA to MP3 on Mac (Step-by-Step Guide)OGG to FLAC: What to Expect from the ConversionAAC to FLAC: Convert and What to ExpectOpus to WAV: How to Convert and Why You Might Need ToWAV to Opus: The Web Developer's Audio GuideOGG vs FLAC: Which Should You Use?Opus vs AAC: Which Codec Is Better?WAV vs FLAC for Archiving: Which Is Best?M4A vs FLAC: Apple AAC vs Lossless Quality ComparedBest Audio Format for Speech-to-Text TranscriptionBest Audio Format for WhatsApp Voice MessagesAudio Formats Windows Media Player Plays NativelyAudio Formats VLC Supports and Its Conversion FeaturesAudio Formats Foobar2000 SupportsAudio Formats Plex Media Server SupportsKodi Audio Format: What Works & What Needs ConversionAudio Formats for PS4 and PS5 USB PlaybackAudio Formats for Xbox USB PlaybackAudio on Nintendo Switch: Limitations and WorkaroundsMP3 vs AAC for AirPods: Does the Codec Matter?How to Play FLAC on iPhone (iOS 11 and Later)How to Play FLAC on Android NativelyWAV to FLAC: Converting Without Any Quality LossAIFF to WAV: macOS to Windows Audio WorkflowM4A to OGG: Converting Apple Audio to Open-SourceOpus Bitrate Guide: 32, 64, 96, 128, 192 kbps ExplainedReduce Audio File Size Without Losing QualityAudio Format Support on Raspberry Pi with mpd and mopidyBest Audio Format in 2025: The Definitive GuideIs yt-dlp Legal? What You Need to KnowLegal Ways to Download Music for Offline ListeningCreative Commons Music for Content Creators: Full GuideWMA to MP3: What to Expect and How to ConvertAIFF to MP3: GarageBand Exports and Quality SettingsMP3 vs. WAV for Podcasting: Which Format to UseBest Audio Format for Discord: Opus, MP3, and File LimitsBest Audio Format for TikTok: Specs and Upload TipsBest Audio Format for Instagram Reels and StoriesHow to Convert Audio on Mac: GarageBand & QuickTimeHow to Convert Audio on iPhone: Files App & BrowserHow to Batch Convert Audio Files: FFmpeg & BrowserFLAC vs. ALAC: Lossless Audio Format ComparisonAudio File Too Large? How to Reduce Audio File SizeAudio Formats for Zoom: Recordings, Uploads, and SharingExtract Audio from MP4 Without Software (Browser Method)VBR vs CBR for MP3: When Each Mode Is the Right ChoiceMP3 128 kbps vs 320 kbps: Does the Difference Matter?M4A vs MP3 for iPhone: Which Format to Use and WhenOGG Vorbis vs MP3: Quality, Compatibility & When OGG WinsBest Audio Format for YouTube Uploads in 2026Best Audio Format for Premiere Pro: Timelines & ExportHow to Convert iPhone Voice Memo to MP3 (Free, No App)How to Convert Zoom Recording to MP3 (M4A or MP4 Export)How to Convert Google Meet Recording to MP3Why Is My Audio File So Large? How to Reduce ItLossless Audio: Is It Worth It? The Honest AnswerHow to Extract Audio from a Zoom Webinar RecordingMP3 File Corrupted: How to Diagnose and Fix ItAudio Format for Spotify: Upload Specs & What HappensBest Free Audio Converter: Browser-Based vs DesktopHow to Compress Audio in Audacity: Size & DynamicsFFmpeg Compress Audio: MP3, FLAC, Opus & AAC One-LinersCompress MP3 Without Losing Quality: What's PossibleHow to Make a Ringtone From an MP3 (iPhone & Android)How to Trim an MP3 Without Losing QualityHow to Cut Audio in Audacity (2026 Step-by-Step)How to Merge Audio Files: Three Real MethodsHow to Remove Vocals From a Song (Honest 2026 Guide)How to Record Audio on Mac: 2026 GuideHow to Record Audio on Windows: 2026 GuideHow to Record Audio on iPhone: 2026 GuideHow to Edit MP3 Metadata: Tools & WorkflowsHow to Find BPM of a Song: 5 MethodsHow to Split Audio Files: 3 Methods That Work