How much does GLM 4.6 cost?

GLM 4.6 costs $0.43/M input tokens and $1.75/M output tokens, with separate rates for Cache Read at $0.08/M tokens.

What is the context length of GLM 4.6?

GLM 4.6 has a 204,800 token context window. It supports up to 16,384 completion tokens.

Does GLM 4.6 support tool calling and structured outputs?

Yes. GLM 4.6 accepts tools and tool_choice for function calling. It also supports structured outputs via a JSON schema in response_format.

Which providers serve GLM 4.6?

GLM 4.6 is served by 5 providers on OpenRouter: Venice, DeepInfra, NovitaAI, Z.ai and AtlasCloud. Requests are routed to the best available provider, with automatic failover to the others, and you can pin or exclude providers with provider routing.

When was GLM 4.6 released?

GLM 4.6 was released on 2025-09-30. Its knowledge cutoff is 2025-03-31.

Z.ai: GLM 4.6 (exacto)

Name: Z.ai: GLM 4.6 (exacto)
Author: z-ai

z-ai/glm-4.6:exacto

Model weights

Compare

Compared with GLM-4.5, this generation brings several key improvements:

Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex agentic tasks. Superior coding performance: The model achieves higher scores on code benchmarks and demonstrates better real-world performance in applications such as Claude Code、Cline、Roo Code and Kilo Code, including improvements in generating visually polished front-end pages. Advanced reasoning: GLM-4.6 shows a clear improvement in reasoning performance and supports tool use during inference, leading to stronger overall capability. More capable agents: GLM-4.6 exhibits stronger performance in tool using and search-based agents, and integrates more effectively within agent frameworks. Refined writing: Better aligns with human preferences in style and readability, and performs more naturally in role-playing scenarios.

Modalities

In / Out Price

$0.43 / $1.75per 1M

Context

205K

Released

Sep 30, 2025

Knowledge Cutoff

Mar 2025

Z.ai: GLM 4.6 (exacto)

z-ai/glm-4.6:exacto

Z.ai: GLM 4.6 (exacto)

z-ai/glm-4.6:exacto

Providers

Effective Pricing

Performance

Uptime

Benchmarks

Apps

Activity

Quick Start

Frequently asked questions

Providers

Effective Pricing

Performance

Uptime

Benchmarks

Apps

Activity

Quick Start

Frequently asked questions

Z.ai: GLM 4.6 (exacto)

z-ai/glm-4.6:exacto

Z.ai: GLM 4.6 (exacto)

z-ai/glm-4.6:exacto

Providers

Effective Pricing

Performance

Uptime

Benchmarks

Apps

Activity

Quick Start

Frequently asked questions

What is GLM 4.6?

How much does GLM 4.6 cost?

What is the context length of GLM 4.6?

Does GLM 4.6 support tool calling and structured outputs?

Which providers serve GLM 4.6?

When was GLM 4.6 released?

Providers

Effective Pricing

Performance

Uptime

Benchmarks

Apps

Activity

Quick Start

Frequently asked questions

What is GLM 4.6?

How much does GLM 4.6 cost?

What is the context length of GLM 4.6?

Does GLM 4.6 support tool calling and structured outputs?

Which providers serve GLM 4.6?

When was GLM 4.6 released?