Text representing spoken words as well as sound effects, speaker names, and non-speech details for accessibility.
Technical Specifications & Context
Captions differ from traditional translations (subtitles) as they are designed specifically for deaf or hard-of-hearing audiences, indexing all audio elements (e.g. [door creaks], [music playing]).
Syntax Format Sample
00:00:12,000 --> 00:00:15,000
[Speaker A]: (whispering) Listen to this.
Ready to Align Your Media Timelines?
Launch our in-browser digital audio workstation. Stamp timestamps, adjust delay offsets, and export SRT, WebVTT, and LRC offline.