SC 1.2.1 Level A WCAG 2.0

Audio-only and Video-only (Prerecorded)

Prerecorded audio needs a text transcript. Prerecorded silent video needs either a text description or an audio track that conveys the same information — so users who can't hear or can't see still get the content.

What it asks

Two single-channel media types are covered. Audio-only files (podcasts, recorded interviews, MP3 clips) must have a text transcript. Video-only files (silent product demos, animated GIFs that convey information, sign-only clips, b-roll explainers) must have either a text alternative or a synchronized audio track that conveys the same information. The point: a deaf user gets the podcast, a blind user gets the silent demo.

How to meet it

Publish a full text transcript directly below the audio player, not behind a download link only.
For silent video, write a text description that captures every action and on-screen text, placed on the same page.
Alternatively, add an audio narration track to silent video that describes what is happening as it happens.
Use machine transcription as a starting point only — Whisper / Otter output must be edited for accuracy and speaker labels.
Mark transcript headings, speaker names, and timestamps with real HTML structure (<h2>, <ol>, <time>) — not a wall of <br> tags.
For embedded social-media audio (Twitter Spaces, voice notes), provide the transcript in the surrounding page text.

Common failures

Podcast episode pages with only show notes — the conversation itself is never transcribed.
Auto-generated YouTube transcript that is 80% accurate, with proper nouns mangled and speaker boundaries missing.
Silent product GIFs with no description anywhere on the page, common on marketing pages.
Transcripts hidden behind “Click to expand” with no programmatic association to the player.
Audio interviews where one speaker is identified but the other is “Speaker 2” throughout.

Why it matters

Transcripts are also the cheapest SEO win on any media-heavy site — search engines index them, sighted users skim them, and they unlock the content for deaf users in one move. Skipping transcripts is almost always a workflow problem, not a technical one.