Cerebras

Browse models provided by Cerebras (Terms of Service)

3 models

Tokens processed on OpenRouter

Google: Gemma 4 31BGemma 4 31B
Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function calling, and multilingual support across 140+ languages. Strong on coding, reasoning, and document understanding tasks. Apache 2.0 license.
by googleApr 2, 2026262K context$0.99/M input tokens$1.49/M output tokens

Cerebras

Browse models provided by Cerebras (Terms of Service)

3 models

Tokens processed on OpenRouter

Google: Gemma 4 31BGemma 4 31B
Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function calling, and multilingual support across 140+ languages. Strong on coding, reasoning, and document understanding tasks. Apache 2.0 license.
by googleApr 2, 2026262K context$0.99/M input tokens$1.49/M output tokens

Z.ai: GLM 4.7GLM 4.7

GLM-4.7 is Z.ai’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while delivering more natural conversational experiences and superior front-end aesthetics.

by z-aiDec 22, 2025200K context$2.25/M input tokens$2.75/M output tokens

OpenAI: gpt-oss-120bgpt-oss-120b

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized to run on a single H100 GPU with native MXFP4 quantization. The model supports configurable reasoning depth, full chain-of-thought access, and native tool use, including function calling, browsing, and structured output generation.

by openaiAug 5, 2025131K context$0.35/M input tokens$0.75/M output tokens