PublicEndpoint - Go SDK

PublicEndpoint type definition

The Go SDK and docs are currently in beta. Report issues on GitHub.

Information about a specific model endpoint

Fields

FieldTypeRequiredDescriptionExample
Namestring✔️N/A
ModelIDstring✔️The unique identifier for the model (permaslug)openai/gpt-4
ModelNamestring✔️N/A
ContextLengthint64✔️N/A
Pricingcomponents.Pricing✔️N/A
ProviderNamecomponents.ProviderName✔️N/AOpenAI
Tagstring✔️N/A
Quantization*components.PublicEndpointQuantization✔️N/Afp16
MaxCompletionTokensint64✔️N/A
MaxPromptTokensint64✔️N/A
SupportedParameters[]components.Parameter✔️N/A
Status*components.EndpointStatusN/A0
UptimeLast30mfloat64✔️N/A
UptimeLast5mfloat64✔️Uptime percentage over the last 5 minutes, calculated as successful requests / (successful + error requests) * 100. Rate-limited requests are excluded. Returns null if insufficient data.
UptimeLast1dfloat64✔️Uptime percentage over the last 1 day, calculated as successful requests / (successful + error requests) * 100. Rate-limited requests are excluded. Returns null if insufficient data.
SupportsImplicitCachingbool✔️N/A
LatencyLast30m*components.PercentileStats✔️Latency percentiles in milliseconds over the last 30 minutes. Latency measures time to first token. Only visible when authenticated with an API key or cookie; returns null for unauthenticated requests.{"p50": 25.5,"p75": 35.2,"p90": 48.7,"p99": 85.3}
ThroughputLast30m*components.PercentileStats✔️N/A{"p50": 25.5,"p75": 35.2,"p90": 48.7,"p99": 85.3}