List Benchmarks

Unified benchmark endpoint that aggregates scores from multiple benchmark sources (Artificial Analysis, Design Arena). Filter by source to reproduce the exact shapes from the legacy per-source endpoints, or use task_type to find models suited for specific workloads. Authenticate with any valid OpenRouter API key. Rate-limited to 30 requests/minute per key and 500 requests/day per account.

Authentication

AuthorizationBearer
API key as bearer token in Authorization header

Query parameters

sourceenumRequired
Benchmark source to query. Determines the shape of the returned items.
Allowed values:
task_typeenumOptional
Filter results by task type. For Artificial Analysis, maps to the corresponding index. For Design Arena, maps to the matching category.
Allowed values:
arenaenumOptional

Design Arena only: arena to query. Defaults to models when source is design-arena.

Allowed values:
categorystringOptional

Design Arena only: category within the arena (e.g. codecategories, uicomponent, gamedev, 3d, dataviz, image, video, svg). When omitted, returns all categories.

max_resultsintegerOptional1-100Defaults to 50

Max results to return (1–100, default 50).

Response

Benchmark results filtered by the specified source and optional task type.
datalist of objects
metaobject

Errors

400
Bad Request Error
401
Unauthorized Error
429
Too Many Requests Error
500
Internal Server Error