Audio to Text
API reference for the audio-to-text pipeline. Accepts audio input and returns a transcript using Whisper-compatible models.
Documentation Index
Fetch the complete documentation index at: https://na-36-merge-docs-v2-dev-draft-into-docs-v2-clean-20260525.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Body
Uploaded audio file to be transcribed.
Hugging Face model ID used for transcription.
Return timestamps for the transcribed text. Supported values: 'sentence', 'word', or a string boolean ('true' or 'false'). Default is 'true' ('sentence'). 'false' means no timestamps. 'word' means word-based timestamps.
Additional job information to be passed to the pipeline.