Microsoft: MAI-Transcribe 1.5

1

Get your API key

Create an API key from your OpenRouter dashboard and set it as an environment variable:

2

Make your first request

Use microsoft/mai-transcribe-1.5 with the OpenRouter API:

OpenRouter provides a speech-to-text API that transcribes audio into text. Send base64-encoded audio with a model, and receive the transcribed text in JSON.

The generation ID is returned in the X-Generation-Id response header for tracking.

Using third-party SDKs

For information about using third-party SDKs and frameworks with OpenRouter, please see our frameworks documentation.

Parameters

Name	Type	Default	Description
`max_tokens`	integer	—	This sets the upper limit for the number of tokens the model can generate in response.
`temperature`	float	`1`	This setting influences the variety in the model's responses.
`top_p`	float	`1`	This setting limits the model's choices to a percentage of likely tokens: only the top tokens whose probabilities add up to P.
`max_completion_tokens`	integer	—	This sets the upper limit for the number of tokens the model can generate in response.