List all models and their properties
Authorizations
API key as bearer token in Authorization header
Query Parameters
Filter models by use case category
programming, roleplay, marketing, marketing/seo, technology, science, translation, legal, finance, health, trivia, academia "programming"
Filter models by supported parameter (comma-separated)
"temperature"
Filter models by output modality. Accepts a comma-separated list of modalities (text, image, audio, embeddings) or "all" to include all models. Defaults to "text".
"text"
Sort the returned models server-side. Prefer this over fetching the full list and sorting client-side. Options: pricing-low-to-high, pricing-high-to-low (average prompt/completion price), context-high-to-low (context length), throughput-high-to-low, latency-low-to-high (recent median performance), most-popular, top-weekly (tokens processed in the last week), newest (creation date), intelligence-high-to-low (Artificial Analysis intelligence index), design-arena-elo-high-to-low (best Design Arena ELO across arenas). Models without a score for the chosen benchmark are placed last. When omitted, the existing default ordering is preserved.
most-popular, newest, top-weekly, pricing-low-to-high, pricing-high-to-low, context-high-to-low, throughput-high-to-low, latency-low-to-high, intelligence-high-to-low, design-arena-elo-high-to-low "newest"
Return results as RSS feed
"true"
Use chat links in RSS feed items
"true"
Free-text search by model name or slug.
"gpt-4"
Filter models by input modality. Comma-separated list of: text, image, audio, file.
"text,image"
Minimum context length (tokens). Models with smaller context are excluded.
x > 0128000
Minimum prompt price in $/M tokens.
x >= 00
Maximum prompt price in $/M tokens.
x >= 010
Filter models by architecture/model family (e.g. GPT, Claude, Gemini, Llama).
"GPT"
Filter models by the organization that created the model. Comma-separated list of author slugs.
"openai,anthropic"
Filter models by hosting provider. Comma-separated list of provider names.
"OpenAI,Anthropic"
Filter by distillation capability. "true" returns only distillable models, "false" excludes them.
true, false "true"
When set to "true", return only models with zero data retention endpoints.
true "true"
Filter to models with endpoints in the given data region. Currently only "eu" is supported.
eu "eu"
Response
Returns a list of models or RSS feed
List of available models
List of available models
[
{
"architecture": {
"input_modalities": ["text"],
"instruct_type": "chatml",
"modality": "text->text",
"output_modalities": ["text"],
"tokenizer": "GPT"
},
"canonical_slug": "openai/gpt-4",
"context_length": 8192,
"created": 1692901234,
"default_parameters": null,
"description": "GPT-4 is a large multimodal model that can solve difficult problems with greater accuracy.",
"expiration_date": null,
"id": "openai/gpt-4",
"knowledge_cutoff": null,
"links": {
"details": "/api/v1/models/openai/gpt-5.4/endpoints"
},
"name": "GPT-4",
"per_request_limits": null,
"pricing": {
"completion": "0.00006",
"image": "0",
"prompt": "0.00003",
"request": "0"
},
"supported_parameters": ["temperature", "top_p", "max_tokens"],
"supported_voices": null,
"top_provider": {
"context_length": 8192,
"is_moderated": true,
"max_completion_tokens": 4096
}
}
]