Audio Formats for Online Course Creators and Educators
Your students judge your course by audio quality first. Muffled, echoey, or inconsistent audio makes a course feel unprofessional regardless of the content quality. This guide covers the recording settings, file formats, and export specs that e-learning platforms expect.
Recording Settings for Voice-Over
Record your narration in WAV at 44.1 kHz, 16-bit, mono. Mono is correct for voice — learners listen on headphones, speakers, or laptop speakers, all of which handle mono correctly. Stereo voice-over wastes file size without benefit. 44.1 kHz is the standard for e-learning platforms. 24-bit gives you more editing headroom, but 16-bit is sufficient if your recording environment is quiet. Record in a dry room — a closet full of clothes, or a room treated with bookshelves and soft furnishings. Microphone placement matters more than equipment: aim for 6-8 inches from your mouth, slightly off-axis to reduce plosives.
E-Learning Platform Requirements
Udemy accepts MP3 and WAV uploads. Recommended: MP3 at 192 kbps, mono. Teachable and Thinkific accept MP4 video with embedded audio — export at AAC 192 kbps, 44.1 kHz. Coursera requires MP4 video. SCORM packages typically use MP3 for standalone audio. Articulate Storyline and Rise export audio embedded in HTML5 packages — they accept WAV and MP3 imports and compress internally. Camtasia records in TSTM format and exports to MP4 with AAC audio. Always check the current requirements on your target platform — specs change.
Processing Your Voice Recording
A basic processing chain for e-learning narration: First, noise reduction — use Audacity's built-in noise reduction, Krisp, or Adobe Podcast's AI enhancement to remove background hum and room noise. Second, EQ — a high-pass filter at 80-100 Hz removes low-frequency rumble from fans, HVAC, and desk vibration. Third, compression — a 3:1 ratio with a -20 dBFS threshold levels out volume differences between sentences. Fourth, normalization — normalize to -3 dBFS peak or -16 LUFS for consistent loudness across all your lessons. Export as WAV from your editing software, then convert to the required delivery format.
Accessibility and Captions
Audio alone is not accessible. Always provide captions or transcripts. YouTube auto-generates captions — upload your MP4 and review the auto-captions for errors. Otter.ai transcribes audio files with high accuracy. REV.com provides human transcription at $1.50 per minute. Embedding captions in your video increases engagement for all learners, not just those with hearing impairments. Students who watch in noisy environments or non-native speakers benefit significantly. Caption file formats: SRT and VTT are universally supported. Most e-learning platforms accept both.
File Organization and Version Control
Organize your course audio by module and lesson: Module01_Lesson01_Narration_v2.wav. Keep the WAV master files always. Your platform's MP3 export is for delivery — never edit from the MP3. When you re-record or fix a sentence, maintain version numbers so you know which take is current. Store masters in cloud storage with version history enabled (Google Drive or Dropbox both offer this). A 10-hour course at WAV quality is roughly 3.5 GB — manageable on any modern cloud storage plan. Backup to a local drive as well. Losing course recordings is painful and expensive to recreate.