Provider Routing | Intelligent Multi-Provider Request Routing | OpenRouter

OpenRouter routes requests to the best available providers for your model. By default, requests are load balanced across the top providers to maximize uptime.

You can customize how your requests are routed using the provider object in the request body for Chat Completions and Completions.

For a complete list of valid provider names to use in the API, see the full provider schema.

The provider object can contain the following fields:

Field	Type	Default	Description
`order`	string[]	-	List of provider names to try in order (e.g. `["Anthropic", "OpenAI"]`). Learn more
`allow_fallbacks`	boolean	`true`	Whether to allow backup providers when the primary is unavailable. Learn more
`require_parameters`	boolean	`false`	Only use providers that support all parameters in your request. Learn more
`data_collection`	”allow” \| “deny"	"allow”	Control whether to use providers that may store data. Learn more
`ignore`	string[]	-	List of provider names to skip for this request. Learn more
`quantizations`	string[]	-	List of quantization levels to filter by (e.g. `["int4", "int8"]`). Learn more
`sort`	string	-	Sort providers by price or throughput. (e.g. `"price"` or `"throughput"`). Learn more

Price-Based Load Balancing (Default Strategy)

For each model in your request, OpenRouter’s default behavior is to load balance requests across providers, prioritizing price.

If you are more sensitive to throughput than price, you can use the sort field to explicitly prioritize throughput.

When you send a request with tools or tool_choice, OpenRouter will only route to providers that support tool use. Similarly, if you set a max_tokens, then OpenRouter will only route to providers that support a response of that length.

Here is OpenRouter’s default load balancing strategy:

Prioritize providers that have not seen significant outages in the last 30 seconds.
For the stable providers, look at the lowest-cost candidates and select one weighted by inverse square of the price (example below).
Use the remaining providers as fallbacks.

A Load Balancing Example

If Provider A costs $1 per million tokens, Provider B costs $2, and Provider C costs $3, and Provider B recently saw a few outages.

Your request is routed to Provider A. Provider A is 9x more likely to be first routed to Provider A than Provider C because $(1 / 3^2 = 1/9)$ (inverse square of the price).
If Provider A fails, then Provider C will be tried next.
If Provider C also fails, Provider B will be tried last.

If you have sort or order set in your provider preferences, load balancing will be disabled.

Provider Sorting

As described above, OpenRouter load balances based on price, while taking uptime into account.

If you instead want to explicitly prioritize a particular provider attribute, you can include the sort field in the provider preferences. Load balancing will be disabled, and the router will try providers in order.

The three sort options are:

"price": prioritize lowest price
"throughput": prioritize highest throughput
"latency": prioritize lowest latency

1 fetch('https://openrouter.ai/api/v1/chat/completions', {
2   method: 'POST',
3   headers: {
4     'Authorization': 'Bearer <OPENROUTER_API_KEY>',
5     'HTTP-Referer': '<YOUR_SITE_URL>', // Optional. Site URL for rankings on openrouter.ai.
6     'X-Title': '<YOUR_SITE_NAME>', // Optional. Site title for rankings on openrouter.ai.
7     'Content-Type': 'application/json',
8   },
9   body: JSON.stringify({
10     'model': 'meta-llama/llama-3.1-70b-instruct',
11     'messages': [
12       {
13         'role': 'user',
14         'content': 'Hello'
15       }
16     ],
17     'provider': {
18       'sort': 'throughput'
19     }
20   }),
21 });

To always prioritize low prices, and not apply any load balancing, set sort to "price".

To always prioritize low latency, and not apply any load balancing, set sort to "latency".

Nitro Shortcut

You can append :nitro to any model slug as a shortcut to sort by throughput. This is exactly equivalent to setting provider.sort to "throughput".

1 fetch('https://openrouter.ai/api/v1/chat/completions', {
2   method: 'POST',
3   headers: {
4     'Authorization': 'Bearer <OPENROUTER_API_KEY>',
5     'HTTP-Referer': '<YOUR_SITE_URL>', // Optional. Site URL for rankings on openrouter.ai.
6     'X-Title': '<YOUR_SITE_NAME>', // Optional. Site title for rankings on openrouter.ai.
7     'Content-Type': 'application/json',
8   },
9   body: JSON.stringify({
10     'model': 'meta-llama/llama-3.1-70b-instruct:nitro',
11     'messages': [
12       {
13         'role': 'user',
14         'content': 'Hello'
15       }
16     ]
17   }),
18 });

Floor Price Shortcut

You can append :floor to any model slug as a shortcut to sort by price. This is exactly equivalent to setting provider.sort to "price".

1 fetch('https://openrouter.ai/api/v1/chat/completions', {
2   method: 'POST',
3   headers: {
4     'Authorization': 'Bearer <OPENROUTER_API_KEY>',
5     'HTTP-Referer': '<YOUR_SITE_URL>', // Optional. Site URL for rankings on openrouter.ai.
6     'X-Title': '<YOUR_SITE_NAME>', // Optional. Site title for rankings on openrouter.ai.
7     'Content-Type': 'application/json',
8   },
9   body: JSON.stringify({
10     'model': 'meta-llama/llama-3.1-70b-instruct:floor',
11     'messages': [
12       {
13         'role': 'user',
14         'content': 'Hello'
15       }
16     ]
17   }),
18 });

Ordering Specific Providers

You can set the providers that OpenRouter will prioritize for your request using the order field.

Field	Type	Default	Description
`order`	string[]	-	List of provider names to try in order (e.g. `["Anthropic", "OpenAI"]`).

The router will prioritize providers in this list, and in this order, for the model you’re using. If you don’t set this field, the router will load balance across the top providers to maximize uptime.

OpenRouter will try them one at a time and proceed to other providers if none are operational. If you don’t want to allow any other providers, you should disable fallbacks as well.

Example: Specifying providers with fallbacks

This example skips over OpenAI (which doesn’t host Mixtral), tries Together, and then falls back to the normal list of providers on OpenRouter:

1 fetch('https://openrouter.ai/api/v1/chat/completions', {
2   method: 'POST',
3   headers: {
4     'Authorization': 'Bearer <OPENROUTER_API_KEY>',
5     'HTTP-Referer': '<YOUR_SITE_URL>', // Optional. Site URL for rankings on openrouter.ai.
6     'X-Title': '<YOUR_SITE_NAME>', // Optional. Site title for rankings on openrouter.ai.
7     'Content-Type': 'application/json',
8   },
9   body: JSON.stringify({
10     'model': 'mistralai/mixtral-8x7b-instruct',
11     'messages': [
12       {
13         'role': 'user',
14         'content': 'Hello'
15       }
16     ],
17     'provider': {
18       'order': [
19         'OpenAI',
20         'Together'
21       ]
22     }
23   }),
24 });

Example: Specifying providers with fallbacks disabled

Here’s an example with allow_fallbacks set to false that skips over OpenAI (which doesn’t host Mixtral), tries Together, and then fails if Together fails:

1 fetch('https://openrouter.ai/api/v1/chat/completions', {
2   method: 'POST',
3   headers: {
4     'Authorization': 'Bearer <OPENROUTER_API_KEY>',
5     'HTTP-Referer': '<YOUR_SITE_URL>', // Optional. Site URL for rankings on openrouter.ai.
6     'X-Title': '<YOUR_SITE_NAME>', // Optional. Site title for rankings on openrouter.ai.
7     'Content-Type': 'application/json',
8   },
9   body: JSON.stringify({
10     'model': 'mistralai/mixtral-8x7b-instruct',
11     'messages': [
12       {
13         'role': 'user',
14         'content': 'Hello'
15       }
16     ],
17     'provider': {
18       'order': [
19         'OpenAI',
20         'Together'
21       ],
22       'allow_fallbacks': false
23     }
24   }),
25 });

Requiring Providers to Support All Parameters

You can restrict requests only to providers that support all parameters in your request using the require_parameters field.

Field	Type	Default	Description
`require_parameters`	boolean	`false`	Only use providers that support all parameters in your request.

With the default routing strategy, providers that don’t support all the LLM parameters specified in your request can still receive the request, but will ignore unknown parameters. When you set require_parameters to true, the request won’t even be routed to that provider.

Example: Excluding providers that don’t support JSON formatting

For example, to only use providers that support JSON formatting:

1 fetch('https://openrouter.ai/api/v1/chat/completions', {
2   method: 'POST',
3   headers: {
4     'Authorization': 'Bearer <OPENROUTER_API_KEY>',
5     'HTTP-Referer': '<YOUR_SITE_URL>', // Optional. Site URL for rankings on openrouter.ai.
6     'X-Title': '<YOUR_SITE_NAME>', // Optional. Site title for rankings on openrouter.ai.
7     'Content-Type': 'application/json',
8   },
9   body: JSON.stringify({
10     'messages': [
11       {
12         'role': 'user',
13         'content': 'Hello'
14       }
15     ],
16     'provider': {
17       'require_parameters': true
18     },
19     'response_format': {
20       'type': 'json_object'
21     }
22   }),
23 });

Requiring Providers to Comply with Data Policies

You can restrict requests only to providers that comply with your data policies using the data_collection field.

Field	Type	Default	Description
`data_collection`	”allow” \| “deny"	"allow”	Control whether to use providers that may store data.

allow: (default) allow providers which store user data non-transiently and may train on it
deny: use only providers which do not collect user data

Some model providers may log prompts, so we display them with a Data Policy tag on model pages. This is not a definitive source of third party data policies, but represents our best knowledge.

Account-Wide Data Policy Filtering

This is also available as an account-wide setting in your privacy settings. You can disable third party model providers that store inputs for training.

Example: Excluding providers that don’t comply with data policies

To exclude providers that don’t comply with your data policies, set data_collection to deny:

1 fetch('https://openrouter.ai/api/v1/chat/completions', {
2   method: 'POST',
3   headers: {
4     'Authorization': 'Bearer <OPENROUTER_API_KEY>',
5     'HTTP-Referer': '<YOUR_SITE_URL>', // Optional. Site URL for rankings on openrouter.ai.
6     'X-Title': '<YOUR_SITE_NAME>', // Optional. Site title for rankings on openrouter.ai.
7     'Content-Type': 'application/json',
8   },
9   body: JSON.stringify({
10     'messages': [
11       {
12         'role': 'user',
13         'content': 'Hello'
14       }
15     ],
16     'provider': {
17       'data_collection': 'deny'
18     }
19   }),
20 });

Disabling Fallbacks

To guarantee that your request is only served by the top (lowest-cost) provider, you can disable fallbacks.

This is combined with the order field from Ordering Specific Providers to restrict the providers that OpenRouter will prioritize to just your chosen list.

1 fetch('https://openrouter.ai/api/v1/chat/completions', {
2   method: 'POST',
3   headers: {
4     'Authorization': 'Bearer <OPENROUTER_API_KEY>',
5     'HTTP-Referer': '<YOUR_SITE_URL>', // Optional. Site URL for rankings on openrouter.ai.
6     'X-Title': '<YOUR_SITE_NAME>', // Optional. Site title for rankings on openrouter.ai.
7     'Content-Type': 'application/json',
8   },
9   body: JSON.stringify({
10     'messages': [
11       {
12         'role': 'user',
13         'content': 'Hello'
14       }
15     ],
16     'provider': {
17       'allow_fallbacks': false
18     }
19   }),
20 });

Ignoring Providers

You can ignore providers for a request by setting the ignore field in the provider object.

Field	Type	Default	Description
`ignore`	string[]	-	List of provider names to skip for this request.

Ignoring multiple providers may significantly reduce fallback options and limit request recovery.

Account-Wide Ignored Providers

You can ignore providers for all account requests by configuring your preferences. This configuration applies to all API requests and chatroom messages.

Note that when you ignore providers for a specific request, the list of ignored providers is merged with your account-wide ignored providers.

Example: Ignoring Azure for a request calling GPT-4 Omni

Here’s an example that will ignore Azure for a request calling GPT-4 Omni:

1 fetch('https://openrouter.ai/api/v1/chat/completions', {
2   method: 'POST',
3   headers: {
4     'Authorization': 'Bearer <OPENROUTER_API_KEY>',
5     'HTTP-Referer': '<YOUR_SITE_URL>', // Optional. Site URL for rankings on openrouter.ai.
6     'X-Title': '<YOUR_SITE_NAME>', // Optional. Site title for rankings on openrouter.ai.
7     'Content-Type': 'application/json',
8   },
9   body: JSON.stringify({
10     'model': 'openai/gpt-4o',
11     'messages': [
12       {
13         'role': 'user',
14         'content': 'Hello'
15       }
16     ],
17     'provider': {
18       'ignore': [
19         'Azure'
20       ]
21     }
22   }),
23 });

Quantization

Quantization reduces model size and computational requirements while aiming to preserve performance. Most LLMs today use FP16 or BF16 for training and inference, cutting memory requirements in half compared to FP32. Some optimizations use FP8 or quantization to reduce size further (e.g., INT8, INT4).

Field	Type	Default	Description
`quantizations`	string[]	-	List of quantization levels to filter by (e.g. `["int4", "int8"]`). Learn more

Quantized models may exhibit degraded performance for certain prompts, depending on the method used.

Providers can support various quantization levels for open-weight models.

Quantization Levels

By default, requests are load-balanced across all available providers, ordered by price. To filter providers by quantization level, specify the quantizations field in the provider parameter with the following values:

int4: Integer (4 bit)
int8: Integer (8 bit)
fp4: Floating point (4 bit)
fp6: Floating point (6 bit)
fp8: Floating point (8 bit)
fp16: Floating point (16 bit)
bf16: Brain floating point (16 bit)
fp32: Floating point (32 bit)
unknown: Unknown

Example: Requesting FP8 Quantization

Here’s an example that will only use providers that support FP8 quantization:

1 fetch('https://openrouter.ai/api/v1/chat/completions', {
2   method: 'POST',
3   headers: {
4     'Authorization': 'Bearer <OPENROUTER_API_KEY>',
5     'HTTP-Referer': '<YOUR_SITE_URL>', // Optional. Site URL for rankings on openrouter.ai.
6     'X-Title': '<YOUR_SITE_NAME>', // Optional. Site title for rankings on openrouter.ai.
7     'Content-Type': 'application/json',
8   },
9   body: JSON.stringify({
10     'model': 'meta-llama/llama-3.1-8b-instruct',
11     'messages': [
12       {
13         'role': 'user',
14         'content': 'Hello'
15       }
16     ],
17     'provider': {
18       'quantizations': [
19         'fp8'
20       ]
21     }
22   }),
23 });

Terms of Service

You can view the terms of service for each provider below. You may not violate the terms of service or policies of third-party providers that power the models on OpenRouter.

OpenAI: https://openai.com/policies/row-terms-of-use/
Anthropic: https://www.anthropic.com/legal/commercial-terms
Google Vertex: https://cloud.google.com/terms/
Google AI Studio: https://cloud.google.com/terms/
Amazon Bedrock: https://aws.amazon.com/service-terms/
Groq: https://groq.com/terms-of-use/
SambaNova: https://sambanova.ai/terms-and-conditions
Cohere: https://cohere.com/terms-of-use
Mistral: https://mistral.ai/terms/#terms-of-use
Together: https://www.together.ai/terms-of-service
Together (lite): https://www.together.ai/terms-of-service
Fireworks: https://fireworks.ai/terms-of-service
DeepInfra: https://deepinfra.com/terms
Lepton: https://www.lepton.ai/policies/tos
NovitaAI: https://novita.ai/legal/terms-of-service
Avian.io: https://avian.io/terms
Lambda: https://lambda.ai/legal/terms-of-service
Azure: https://www.microsoft.com/en-us/legal/terms-of-use?oneroute=true
Perplexity: https://www.perplexity.ai/hub/legal/perplexity-api-terms-of-service
DeepSeek: https://chat.deepseek.com/downloads/DeepSeek%20Terms%20of%20Use.html
Infermatic: https://infermatic.ai/terms-and-conditions/
AI21: https://www.ai21.com/terms-of-service/
Featherless: https://featherless.ai/terms
Inflection: https://developers.inflection.ai/tos
xAI: https://x.ai/legal/terms-of-service
Cloudflare: https://www.cloudflare.com/service-specific-terms-developer-platform/#developer-platform-terms
InoCloud: https://inocloud.com/terms
Minimax: https://www.minimax.io/platform/protocol/terms-of-service
NextBit: https://www.nextbit256.com/docs/terms-of-service
Nineteen: https://nineteen.ai/tos
Liquid: https://www.liquid.ai/terms-conditions
Inception: https://www.inceptionlabs.ai/terms
GMICloud: https://docs.gmicloud.ai/privacy
nCompass: https://ncompass.tech/terms
inference.net: https://inference.net/terms-of-service
Friendli: https://friendli.ai/terms-of-service
AionLabs: https://www.aionlabs.ai/terms/
Alibaba: https://www.alibabacloud.com/help/en/legal/latest/alibaba-cloud-international-website-product-terms-of-service-v-3-8-0
Nebius AI Studio: https://docs.nebius.com/legal/studio/terms-of-use/
Chutes: https://chutes.ai/tos
kluster.ai: https://www.kluster.ai/terms-of-use
Crusoe: https://legal.crusoe.ai/open-router#managed-inference-tos-open-router
Targon: https://targon.com/terms
Ubicloud: https://www.ubicloud.com/docs/about/terms-of-service
Parasail: https://www.parasail.io/legal/terms
Phala: https://red-pill.ai/terms
CentML: https://centml.ai/terms-of-service/
Venice: https://venice.ai/legal/tos
OpenInference: https://www.openinference.xyz/terms
Atoma: https://atoma.network/terms_of_service
Enfer: https://enfer.ai/privacy-policy
Mancer: https://mancer.tech/terms
Mancer (private): https://mancer.tech/terms
Hyperbolic: https://hyperbolic.xyz/terms
Hyperbolic (quantized): https://hyperbolic.xyz/terms

JSON Schema for Provider Preferences

For a complete list of options, see this JSON schema:

Provider Preferences Schema

1 {
2     "$ref": "#/definitions/Provider Preferences Schema",
3     "definitions": {
4       "Provider Preferences Schema": {
5         "type": "object",
6         "properties": {
7           "allow_fallbacks": {
8             "type": [
9               "boolean",
10               "null"
11             ],
12             "description": "Whether to allow backup providers to serve requests\n- true: (default) when the primary provider (or your custom providers in \"order\") is unavailable, use the next best provider.\n- false: use only the primary/custom provider, and return the upstream error if it's unavailable.\n"
13           },
14           "require_parameters": {
15             "type": [
16               "boolean",
17               "null"
18             ],
19             "description": "Whether to filter providers to only those that support the parameters you've provided. If this setting is omitted or set to false, then providers will receive only the parameters they support, and ignore the rest."
20           },
21           "data_collection": {
22             "anyOf": [
23               {
24                 "type": "string",
25                 "enum": [
26                   "deny",
27                   "allow"
28                 ]
29               },
30               {
31                 "type": "null"
32               }
33             ],
34             "description": "Data collection setting. If no available model provider meets the requirement, your request will return an error.\n- allow: (default) allow providers which store user data non-transiently and may train on it\n- deny: use only providers which do not collect user data.\n"
35           },
36           "order": {
37             "anyOf": [
38               {
39                 "type": "array",
40                 "items": {
41                   "type": "string",
42                   "enum": [
43                     "OpenAI",
44                     "Anthropic",
45                     "Google",
46                     "Google AI Studio",
47                     "Amazon Bedrock",
48                     "Groq",
49                     "SambaNova",
50                     "Cohere",
51                     "Mistral",
52                     "Together",
53                     "Together 2",
54                     "Fireworks",
55                     "DeepInfra",
56                     "Lepton",
57                     "Novita",
58                     "Avian",
59                     "Lambda",
60                     "Azure",
61                     "Perplexity",
62                     "DeepSeek",
63                     "Infermatic",
64                     "AI21",
65                     "Featherless",
66                     "Inflection",
67                     "xAI",
68                     "Cloudflare",
69                     "InoCloud",
70                     "Minimax",
71                     "NextBit",
72                     "Nineteen",
73                     "Liquid",
74                     "Inception",
75                     "GMICloud",
76                     "Stealth",
77                     "NCompass",
78                     "InferenceNet",
79                     "Friendli",
80                     "AionLabs",
81                     "Alibaba",
82                     "Nebius",
83                     "Chutes",
84                     "Kluster",
85                     "Crusoe",
86                     "Targon",
87                     "Ubicloud",
88                     "Parasail",
89                     "Phala",
90                     "Cent-ML",
91                     "Venice",
92                     "OpenInference",
93                     "Atoma",
94                     "Enfer",
95                     "Mancer",
96                     "Mancer 2",
97                     "Hyperbolic",
98                     "Hyperbolic 2",
99                     "Reflection"
100                   ]
101                 }
102               },
103               {
104                 "type": "null"
105               }
106             ],
107             "description": "An ordered list of provider names. The router will attempt to use the first provider in the subset of this list that supports your requested model, and fall back to the next if it is unavailable. If no providers are available, the request will fail with an error message."
108           },
109           "ignore": {
110             "anyOf": [
111               {
112                 "type": "array",
113                 "items": {
114                   "type": "string",
115                   "enum": [
116                     "OpenAI",
117                     "Anthropic",
118                     "Google",
119                     "Google AI Studio",
120                     "Amazon Bedrock",
121                     "Groq",
122                     "SambaNova",
123                     "Cohere",
124                     "Mistral",
125                     "Together",
126                     "Together 2",
127                     "Fireworks",
128                     "DeepInfra",
129                     "Lepton",
130                     "Novita",
131                     "Avian",
132                     "Lambda",
133                     "Azure",
134                     "Perplexity",
135                     "DeepSeek",
136                     "Infermatic",
137                     "AI21",
138                     "Featherless",
139                     "Inflection",
140                     "xAI",
141                     "Cloudflare",
142                     "InoCloud",
143                     "Minimax",
144                     "NextBit",
145                     "Nineteen",
146                     "Liquid",
147                     "Inception",
148                     "GMICloud",
149                     "Stealth",
150                     "NCompass",
151                     "InferenceNet",
152                     "Friendli",
153                     "AionLabs",
154                     "Alibaba",
155                     "Nebius",
156                     "Chutes",
157                     "Kluster",
158                     "Crusoe",
159                     "Targon",
160                     "Ubicloud",
161                     "Parasail",
162                     "Phala",
163                     "Cent-ML",
164                     "Venice",
165                     "OpenInference",
166                     "Atoma",
167                     "Enfer",
168                     "Mancer",
169                     "Mancer 2",
170                     "Hyperbolic",
171                     "Hyperbolic 2",
172                     "Reflection"
173                   ]
174                 }
175               },
176               {
177                 "type": "null"
178               }
179             ],
180             "description": "List of provider names to ignore. If provided, this list is merged with your account-wide ignored provider settings for this request."
181           },
182           "quantizations": {
183             "anyOf": [
184               {
185                 "type": "array",
186                 "items": {
187                   "type": "string",
188                   "enum": [
189                     "int4",
190                     "int8",
191                     "fp4",
192                     "fp6",
193                     "fp8",
194                     "fp16",
195                     "bf16",
196                     "fp32",
197                     "unknown"
198                   ]
199                 }
200               },
201               {
202                 "type": "null"
203               }
204             ],
205             "description": "A list of quantization levels to filter the provider by."
206           },
207           "sort": {
208             "anyOf": [
209               {
210                 "type": "string",
211                 "enum": [
212                   "price",
213                   "throughput",
214                   "latency"
215                 ]
216               },
217               {
218                 "type": "null"
219               }
220             ],
221             "description": "The sorting strategy to use for this request, if \"order\" is not specified. When set, no load balancing is performed."
222           }
223         },
224         "additionalProperties": false
225       }
226     },
227     "$schema": "http://json-schema.org/draft-07/schema#"
228   }