Speech / Audio Annotator (ASR / Voice Labels)

Annotate audio for ASR/voice tasks: transcripts, speaker turns, timestamps, emotion/intention tags, audio quality labels and voice characteristics. Ideal for voice datasets and conversational AI.



Responsibilities


Core responsibilities



  • Create highly accurate verbatim or corrected transcripts (per client spec).

  • Insert timestamps, speaker diarization and metadata (accent, noise, call quality).

  • Label conversation acts (intent, interruption, silence).

  • Validate automated ASR outputs and correct where needed.

  • Provide short audio samples for ambiguous cases.



Qualifications


Skills & experience



  • Must: excellent listening skills, strong grammar and typing accuracy.

  • Preferable: experience in ASR or speech labeling tools, exposure to multi-accent audio.

  • Nice-to-have: linguistics background, bilingual ability for localization tasks.


Back to blog