Generates transcripts and captions for the input audio or video in the source language or in a target language.
Optional
Generates transcripts and captions for the input audio or video in the source language or in a target language.