Understanding Audio Compression and Bitrate

What Is Audio Compression?

Sound is essentially vibrations in air. To convert those vibrations into a digital file, a computer needs to "sample" the sound at extremely short intervals — recording the exact state of the sound wave at each moment. The more frequently it samples and the more precisely it records, the closer the reproduction is to the original, but the larger the file becomes.

The purpose of audio compression is to find a balance between "quality" and "file size." Imagine painting a picture: an uncompressed WAV is like a hyper-detailed painting with every pixel fully rendered; an MP3 is like a painting that preserves all the important details while omitting the subtle nuances most people won't notice.

Lossy vs Lossless Compression

Audio compression comes in two fundamentally different types:

Lossy Compression

Lossy compression permanently deletes some audio data to shrink the file. The data removed is typically detail that human ears don't easily perceive — ultra-high frequencies, sounds masked by louder ones, and so on. The advantage is that files can become extremely small.

Common lossy formats: MP3, AAC, OGG Vorbis, WMA.

Crucially, lossy compression is irreversible. Once compressed, the discarded data is gone forever. Even converting an MP3 back to WAV won't restore the quality.

Lossless Compression

Lossless compression shrinks files through smarter data arrangement without discarding anything. The decompressed audio is bit-for-bit identical to the original. The principle is similar to ZIP compression — compress it, unzip it, the file is exactly the same as before.

Common lossless formats: FLAC, ALAC (Apple Lossless), APE.

Lossless compression achieves less dramatic size reduction than lossy compression, typically shrinking files to 50–70% of the original. The benefit is zero quality loss.

Bitrate Explained

Bitrate is the amount of audio data per unit of time, measured in kbps (kilobits per second). Think of it like a water pipe — the wider the pipe (higher bitrate), the more water (audio detail) can flow through.

Audio Quality at Different Bitrates

BitrateQuality DescriptionBest Use Case
64 kbpsNoticeable distortion, muffled soundVoice memos, low-quality previews
96 kbpsAcceptable, limited detailVoice call quality
128 kbpsUsable, adequate for casual listeningBasic streaming quality
192 kbpsGood — most listeners are satisfiedEveryday music listening
256 kbpsExcellent, near-original qualityiTunes Store default quality
320 kbpsOutstanding — the best MP3 qualityHigh-quality collections

Bitrate and File Size

Bitrate and file size are proportional. A simple formula:

File size (MB) = Bitrate (kbps) × Duration (seconds) ÷ 8 ÷ 1024

Example: 128kbps × 240 seconds (4 minutes) ÷ 8 ÷ 1024 ≈ 3.75 MB

Most people can't distinguish 192kbps and above from lossless when listening normally. If you're not using professional audio equipment, 256kbps MP3 is an excellent choice.

Sample Rate Explained

Sample rate is how many times per second the sound wave is "photographed," measured in Hz (Hertz). According to the Nyquist theorem, the sample rate must be at least twice the highest frequency to accurately reproduce sound.

Common Sample Rates

Is Higher Always Better?

Not necessarily. Human hearing tops out at around 20,000 Hz (and decreases with age), so a 44,100 Hz sample rate can theoretically capture every frequency the human ear can hear. Higher sample rates are mainly used in professional settings — during recording and mixing, ultra-high sample rates can prevent distortion introduced by digital processing.

Bit Depth Explained

If sample rate determines "how often you photograph," bit depth determines "the resolution of each photo." Higher bit depth means each sample captures volume information with finer precision.

Common Bit Depths

What Is Dynamic Range?

Dynamic range is the gap between the loudest and quietest sounds. The 96 dB dynamic range of 16-bit audio means it can simultaneously record a very delicate piano pianissimo and a thundering rock concert. For everyday music listening, this is more than sufficient.

CBR vs VBR: Constant vs Variable Bitrate

When compressing audio, there's another important choice: Constant Bitrate (CBR) or Variable Bitrate (VBR).

CBR (Constant Bitrate)

The entire audio file uses the same bitrate throughout. Whether a section is quiet or complex, the same amount of data is used to represent it.

VBR (Variable Bitrate)

The bitrate adjusts dynamically based on the complexity of the audio. Simple sections (like silence or solo voice) use a lower bitrate; complex sections (like a symphonic climax) use a higher bitrate.

Recommendation: Unless you have a specific need (strict file size limits or compatibility with old devices), VBR generally gives better results.

Recommended Settings by Use Case

Use CaseFormatBitrateSample Rate
Everyday listeningMP3256–320 kbps44,100 Hz
Podcast publishingMP396–128 kbps (mono)44,100 Hz
RingtonesMP3 / M4R192 kbps44,100 Hz
Video background musicWAV / AACUncompressed / 256 kbps48,000 Hz
Studio recordingWAVUncompressed48,000–96,000 Hz
Music archiveFLACLosslessOriginal sample rate
Web audio effectsMP3 / OGG128 kbps44,100 Hz
Try MP3 Cutter Tool