What Is Audio Metadata?
Audio metadata is information stored inside an audio file that describes its content without being part of the audio itself. Artist name, track title, album, year, genre, and embedded album art are all metadata. Different audio formats use different metadata systems, and understanding them helps you manage libraries and ensure tags survive format conversions.
Why Audio Metadata Matters
Without metadata, a media player shows filenames instead of track names. Streaming apps cannot organize music by artist or album. Podcast players cannot display show names or episode numbers. Shuffle and sorting features break down.
Metadata is what makes an audio file useful in a library context rather than just a sequence of samples. It is what Spotify reads to display song information, what iTunes uses to organize albums, what podcast apps read for episode titles.
For professional workflows: metadata is also how delivery systems track content. Broadcast systems use embedded data for logging. Streaming platforms verify artist and album information through metadata matching. Getting metadata right before delivering audio is a professional requirement.
ID3 Tags: Metadata for MP3
ID3 tags are the metadata standard for MP3 files. ID3 stands for Identify an MP3 — the standard was created to solve MP3's original lack of built-in metadata.
Two main versions exist: ID3v1 stores fixed-length fields in the last 128 bytes of the MP3 file. It supports basic fields (title, artist, album, year, genre, comment) limited to 30 characters each. ID3v2 is more flexible, stored at the beginning of the file, and supports longer strings, Unicode text, and binary data like album artwork.
All modern software uses ID3v2. Common tags: TIT2 (title), TPE1 (artist), TALB (album), TRCK (track number), TDRC (recording year), APIC (attached picture/album art). Understanding these helps when debugging metadata issues in automated workflows.
Vorbis Comments: OGG and FLAC Metadata
OGG Vorbis and FLAC files use Vorbis Comment tags. Unlike ID3's complex binary format, Vorbis Comments are simple UTF-8 key=value pairs stored in the file header.
Common fields: TITLE, ARTIST, ALBUM, TRACKNUMBER, DATE, GENRE, COMMENT, DISCNUMBER, ALBUMARTIST.
Vorbis Comments are case-insensitive and extensible — you can define custom fields. REPLAYGAIN_TRACK_GAIN and REPLAYGAIN_ALBUM_GAIN are common non-standard fields supported by many players.
FLAC has a dedicated PICTURE metadata block for album art. OGG Vorbis embeds artwork using a base64-encoded METADATA_BLOCK_PICTURE Vorbis Comment field.
The simplicity of Vorbis Comments makes them easier to edit programmatically than ID3 tags.
M4A and AAC Metadata: iTunes Atoms
M4A and AAC files store metadata in MPEG-4 container atoms — sometimes called iTunes tags because Apple popularized the format.
Common atom tags: ©nam (title), ©ART (artist), ©alb (album), trkn (track number), ©day (year), ©gen (genre), covr (cover artwork), ©lyr (lyrics).
Atom tags support Unicode and can store binary data. Album art is stored directly in the covr atom, supporting JPEG and PNG formats.
When converting M4A to other formats using AudioUtils (via FFmpeg), these atom tags are remapped to the target format's tag system. Common fields transfer reliably. Less common fields may be dropped if there is no equivalent in the target metadata format.
Metadata and Format Conversion
Format conversion does not always preserve all metadata. FFmpeg — which AudioUtils uses — remaps common tags between formats during conversion:
Title, artist, album, track number, year, and genre almost always transfer.
Album art typically transfers between formats that support embedded artwork.
Format-specific tags (REPLAYGAIN, iTunes-specific atoms, ID3 v2.4 extended tags) may be dropped if the target format has no equivalent.
Best practice: before converting a large audio library, test a single file and verify the metadata in the output using a media player or tag editor. If important tags are missing after conversion, add them manually or use a batch tag editor after converting.
Embedded Album Art: Formats and Sizes
Album art is stored directly inside audio files as embedded binary data. Most formats support JPEG and PNG for embedded artwork.
Recommended specs for embedded artwork: JPEG format, 500x500 to 1000x1000 pixels, under 500 KB. Larger artwork increases file size noticeably and some players display only a scaled-down thumbnail anyway.
Format conventions: MP3 stores artwork in the APIC frame of ID3v2 tags. FLAC uses a dedicated PICTURE metadata block. M4A stores artwork in the covr atom. OGG encodes artwork as base64 in a METADATA_BLOCK_PICTURE field.
AudioUtils format conversions preserve embedded artwork when FFmpeg's tag mapping supports it. Verify artwork in the output file if it is important to your workflow.