Why Is My Audio File So Large? (And How to Make It Smaller)
WAV files are large because they store uncompressed PCM. Learn the math behind audio file sizes, when to compress, and which format reduces size without quality loss.
Audio files balloon for predictable mathematical reasons. A 5-minute song that arrives as a 3 MB MP3 takes up 50 MB as a CD-quality WAV — sixteen times larger — because uncompressed PCM stores every single sample as a number while MP3 throws away most of them. Once you understand the four variables that govern file size (format, sample rate, bit depth, channels), it becomes obvious why a Zoom recording is small but your studio session is huge, and exactly how to shrink either one.
The Four Variables
Every audio file's size traces back to four numbers:
1. Whether it is compressed. Uncompressed PCM (WAV, AIFF) stores every sample. Lossless compression (FLAC, ALAC) packs them more efficiently. Lossy compression (MP3, AAC, Opus) discards inaudible content. 2. Sample rate. Samples per second of audio. CD-standard is 44,100. Video uses 48,000. Hi-res can reach 192,000. 3. Bit depth. Bits per sample. CD is 16-bit. Studio masters are usually 24-bit. 32-bit float is used during processing. 4. Channels. Mono is 1, stereo is 2, 5.1 surround is 6, Dolby Atmos can be 12 or more.
Multiply all four and you have the bitrate of uncompressed PCM. Add duration to get total file size.
Why Audio Files Can Be Surprisingly Large
The size of an audio file depends on two things: whether it uses compression, and if so, how much. Understanding the math makes file size predictable — and shows you exactly how much you can reduce it.
Uncompressed Audio: The Size Math
WAV and AIFF files store raw PCM audio with no compression. The file size formula:
File size (bytes) = sample rate x bit depth / 8 x channels x duration (seconds)
For a 1-hour stereo WAV at 44.1 kHz, 16-bit:
For a 1-hour mono WAV at the same specs:
This is why a 1-hour Zoom recording saved as WAV is 300-600 MB, while the M4A from the same call is under 100 MB.
Compressed Formats: How Much Smaller?
| Format | 1-hour stereo | Compression ratio vs WAV | |--------|--------------|--------------------------| | WAV 44.1 kHz 16-bit | ~600 MB | 1x (baseline) | | FLAC (lossless) | ~300 MB | ~2x smaller | | MP3 320 kbps | ~140 MB | ~4x smaller | | MP3 192 kbps | ~85 MB | ~7x smaller | | MP3 128 kbps | ~55 MB | ~11x smaller | | AAC 256 kbps | ~115 MB | ~5x smaller | | AAC 128 kbps | ~55 MB | ~11x smaller | | Opus 64 kbps | ~28 MB | ~21x smaller |
When to Compress: Quality vs Size Tradeoffs
Lossless compression (FLAC): about 50% size reduction with zero quality loss. Use FLAC whenever you need lossless audio but want to save storage.
Lossy compression (MP3, AAC, Opus): 75-95% size reduction with some quality loss. Use lossy compression for distribution, sharing, and any use case where the audio does not need to be re-edited.
The key rule: never compress a file you might edit later. Keep WAV or FLAC as your master. Create compressed versions for distribution.
Reducing Size Without Quality Loss
1. WAV to FLAC: reduces size by ~50% with zero quality loss. The best option for lossless archives. 2. Stereo to mono: if the audio is a voice recording with no stereo information, converting to mono cuts file size exactly in half. 3. Sample rate reduction: 48 kHz to 44.1 kHz reduces size by ~8%. Only do this if 44.1 kHz is sufficient for your use case.
Reducing Size with Some Quality Loss
For sharing and distribution, lossy compression is appropriate:
Why Stereo Doubles File Size
Stereo files are exactly twice the size of mono at the same sample rate and bit depth. Every sample timestamp now has two numbers (left channel and right channel) instead of one. For voice recordings — where the same audio comes out of both speakers — this is wasteful. A podcast voiceover at 16-bit/44.1 kHz mono is 5.3 MB/min; the same audio in stereo is 10.6 MB/min and contains zero additional information.
Convert spoken-word stereo files to mono and you halve the file size with no audible loss. For music, stereo is meaningful and should stay stereo.
Why Bit Depth Doubling Matters
A 24-bit file is 50% larger than a 16-bit file at the same sample rate. The 8 extra bits per sample give 144 dB of dynamic range instead of 96 dB — useful during recording and mixing where you want headroom for editing, but irrelevant for finished playback because no consumer environment exceeds 70 dB of dynamic range. Mastering for distribution at 24-bit and only dithering down to 16-bit at the very end is the standard workflow.
For archival of finished material, 16-bit/44.1 kHz is the right ceiling unless you specifically need the headroom for future processing.
Why Sample Rate Multiplies
Doubling sample rate doubles file size because there are twice as many samples per second. A 96 kHz file is twice the size of a 48 kHz file. A 192 kHz file is four times. The audible benefit above 48 kHz is debated and well-tested as inaudible for most listeners on most material — see audio sample rate explained for the deep dive. For finished content destined for headphones or speakers, 44.1 or 48 kHz is the right ceiling.
Multichannel Multiplication
Stereo doubles mono size. 5.1 surround quadruples it (6 channels of audio at the same sample rate and bit depth). Dolby Atmos object-based audio with 12 channels makes a single hour of master content multiple GB. This is why home cinema masters arrive on hard drives instead of optical media, and why streaming services compress aggressively before delivery.
The Practical Approach
Keep your WAV master. Create compressed versions for specific uses. A 600 MB WAV archive becomes a 55 MB MP3 for sharing — decide what quality level the sharing use case requires, then compress to that target. For voice content, mono MP3 at 64–128 kbps is plenty. For music distribution, MP3 V2 VBR or AAC at 256 kbps approaches transparency. For lossless archive that still saves space, FLAC at compression level 5 is the sweet spot. The browser-based audio compressor handles any of these targets without uploading your file.
For deeper context on PCM and how the math works, see PCM audio explained and what is PCM. For target bitrate selection, see audio bitrate guide by use case.