AudioUtils
Workflow Guide

Audio for E-Learning

E-learning audio quality directly affects learner engagement and comprehension. Poor audio — muffled voiceover, background noise, inconsistent volume — causes learners to disengage. This guide covers the formats, settings, and production practices that produce clear, professional audio for online courses and training content.

Format Requirements for LMS Platforms

Learning management systems (LMS) like Moodle, Canvas, Blackboard, and Articulate SCORM packages generally support MP3 and AAC inside SCORM, xAPI, and HTML5 content. MP3 at 128 kbps mono is standard for voiceover-only audio. For audio with music, 192 kbps stereo provides a good balance of quality and file size. AAC inside an M4A container is increasingly accepted by modern LMS platforms and offers slightly better quality than MP3 at the same bitrate. Avoid WAV for delivered e-learning content — file sizes are impractical for web delivery. Keep audio in WAV or FLAC during production, then export to MP3 or AAC for delivery.

Voiceover Recording Best Practices

Record voiceover in a quiet space — a small room with soft furnishings reduces echo. Use a cardioid condenser or dynamic microphone (USB mics like the Blue Yeti or Rode NT-USB are popular choices for e-learning production). Record at 44.1 kHz, 24-bit. Keep the microphone 6-10 cm from your mouth, slightly off-axis to reduce plosive sounds (P, B, T). Use a pop filter. Normalize audio after recording to -16 to -12 LUFS to ensure consistent playback volume across a course. Apply gentle high-pass filtering (roll off below 80 Hz) to remove room rumble. A touch of compression (2:1 ratio, slow attack, fast release) evens out dynamic fluctuations between sentences.

Accessibility Considerations

Audio-only e-learning content must be accompanied by transcripts for accessibility compliance with WCAG 2.1 and Section 508. Ensure voiceover speech is clear, measured, and at an appropriate pace — the optimal narration speed for learning content is 150-160 words per minute, slightly slower than conversational speech. Background music (if used) should be 15-20 dB quieter than the voiceover — background music that competes with narration is a common barrier for learners with hearing impairments or auditory processing differences. Test your audio with a screen reader to ensure media player controls are accessible.

File Size Optimization

E-learning files need to load quickly, especially for learners on mobile connections. Voiceover audio converted to mono immediately halves the file size compared to stereo, with no perceptible quality loss for speech content. 128 kbps MP3 mono produces approximately 1 MB per minute — a 10-minute lesson has a 10 MB audio track, very manageable. Avoid embedding uncompressed audio in course packages. Use HTML5 audio elements with MP3 and AAC sources (with MP3 as fallback) for maximum browser compatibility. Lazy-load audio assets that are not in the current slide or scene to reduce initial load time.

Handling Music and Sound Effects

Use royalty-free music from sources like Pixabay, Incompetech, or subscription services like Epidemic Sound for background tracks. Export music beds as MP3 at 192 kbps stereo. Sound effects (notifications, button clicks, transitions) can be kept as short WAV files for low latency, or MP3 for file size savings. Avoid looping music that has an obvious break point — use tracks with natural loop points or crossfade the loop in your authoring tool. Ensure all music and sound effects are properly licensed for commercial e-learning use — stock libraries often have educational vs commercial licensing distinctions.