Create transcription
Transcribes audio into text. Accepts base64-encoded audio input and returns the transcribed text.
Authorizations
API key as bearer token in Authorization header
Body
Speech-to-text request input. Accepts a JSON body with input_audio containing base64-encoded audio.
Base64-encoded audio to transcribe
{ "data": "UklGRiQA...", "format": "wav" }STT model identifier
"openai/whisper-large-v3"
ISO-639-1 language code (e.g., "en", "ja"). Auto-detected if omitted.
"en"
Provider-specific passthrough configuration
Sampling temperature for transcription
0
Response
Transcription result
STT response containing transcribed text and optional usage statistics
The transcribed text
"Hello, this is a test of OpenAI speech-to-text transcription. The weather is sunny today and the temperature is around 72 degrees."
Aggregated usage statistics for the request
{
"cost": 0.000508,
"input_tokens": 83,
"output_tokens": 30,
"seconds": 9.2,
"total_tokens": 113
}