PercentileThroughputCutoffs - Python SDK

PercentileThroughputCutoffs method reference

The Python SDK and docs are currently in beta. Report issues on GitHub.

Percentile-based throughput cutoffs. All specified cutoffs must be met for an endpoint to be preferred.

Fields

FieldTypeRequiredDescription
p50OptionalNullable[float]Minimum p50 throughput (tokens/sec)
p75OptionalNullable[float]Minimum p75 throughput (tokens/sec)
p90OptionalNullable[float]Minimum p90 throughput (tokens/sec)
p99OptionalNullable[float]Minimum p99 throughput (tokens/sec)