Skip to content
No models found
OpenRouter
© 2026 OpenRouter, Inc

Product

  • Chat
  • Rankings
  • Apps
  • Models
  • Providers
  • Pricing
  • Enterprise
  • Labs

Company

  • About
  • Announcements
  • CareersHiring
  • Privacy
  • Terms of Service
  • Support
  • State of AI
  • Works With OR
  • Data

Developer

  • Documentation
  • API Reference
  • SDK
  • Status

Connect

  • Discord
  • GitHub
  • LinkedIn
  • X
  • YouTube
Favicon for microsoft

Microsoft: MAI-Transcribe 1.5

microsoft/mai-transcribe-1.5

Compare

MAI-Transcribe 1.5 is Microsoft's fast transcription model powered by Azure AI Speech. It supports 100+ BCP-47 locales with automatic language detection, automatic punctuation, and duration-based per-second billing. Uses the Azure Speech fast transcription API (v2025-10-15).

Modalities

Price

$0.36/hour

Released

Jun 2, 2026

Overview
Providers
Performance
Pricing
Apps
Activity
Uptime
API

Sample code and API for MAI-Transcribe 1.5

OpenRouter normalizes requests and responses across providers for you.

1

Get your API key

Create an API key from your OpenRouter dashboard and set it as an environment variable:

2

Make your first request

Use microsoft/mai-transcribe-1.5 with the OpenRouter API:

OpenRouter provides a speech-to-text API that transcribes audio into text. Send base64-encoded audio with a model, and receive the transcribed text in JSON.

The generation ID is returned in the X-Generation-Id response header for tracking.

Using third-party SDKs

For information about using third-party SDKs and frameworks with OpenRouter, please see our frameworks documentation.

Endpoint

Transcribes audio into text. Accepts base64-encoded audio input and returns the transcribed text.

POSThttps://openrouter.ai/api/v1/audio/transcriptions
AuthorizationBearer $OPENROUTER_API_KEY
Content-Typeapplication/json
HTTP-Refereroptional — your site URL, for rankings
X-Titleoptional — your site name, for rankings
Modelmicrosoft/mai-transcribe-1.5

Parameters

NameTypeDefaultDescription
max_tokensinteger—This sets the upper limit for the number of tokens the model can generate in response.
temperaturefloat1This setting influences the variety in the model's responses.
top_pfloat1This setting limits the model's choices to a percentage of likely tokens: only the top tokens whose probabilities add up to P.
max_completion_tokensinteger—This sets the upper limit for the number of tokens the model can generate in response.