PreferredMaxLatency | OpenRouter Python SDK | OpenRouter

The Python SDK and docs are currently in beta. Report issues on GitHub.

Preferred maximum latency (in seconds). Can be a number (applies to p50) or an object with percentile-specific cutoffs. Endpoints above the threshold(s) may still be used, but are deprioritized in routing. When using fallback models, this may cause a fallback model to be used instead of the primary model if it meets the threshold.

Supported Types

`float`

1 value: float = /* values here */

`components.PercentileLatencyCutoffs`

1 value: components.PercentileLatencyCutoffs = /* values here */

`Any`

1 value: Any = /* values here */