What is the context length of Qwen3 VL 8B Instruct?

Qwen3 VL 8B Instruct has a 262,144 token context window. It supports up to 32,768 completion tokens.

How much does Qwen3 VL 8B Instruct cost?

Qwen3 VL 8B Instruct costs $0.117 per million input tokens and $0.455 per million output tokens.

What providers serve Qwen3 VL 8B Instruct, and can I use it via API?

Qwen3 VL 8B Instruct is available through the OpenRouter API from 2 providers.

What modalities does Qwen3 VL 8B Instruct support?

Qwen3 VL 8B Instruct accepts image, text input and produces text output.

When was Qwen3 VL 8B Instruct released?

Qwen3 VL 8B Instruct was released on 2025-10-14.

Qwen: Qwen3 VL 8B Instruct

Name: Qwen: Qwen3 VL 8B Instruct
Author: qwen

qwen/qwen3-vl-8b-instruct

Model weights

Compare

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon temporal reasoning, DeepStack for fine-grained visual-text alignment, and text-timestamp alignment for precise event localization.

The model supports a native 256K-token context window, extensible to 1M tokens, and handles both static and dynamic media inputs for tasks like document parsing, visual question answering, spatial reasoning, and GUI control. It achieves text understanding comparable to leading LLMs while expanding OCR coverage to 32 languages and enhancing robustness under varied visual conditions.

Modalities

In / Out Price

$0.117 / $0.455per 1M

Context

262K

Released

Oct 14, 2025

Providers

Different companies host the same model. OpenRouter routes your request to one of them based on the routing mode you pick — Balanced (price + speed), Nitro (fastest), or Exacto (highest tool-calling accuracy).

Effective Pricing

The chart below shows the average price customers are actually paying after prompt caching. Depending on the amount of repeated context you send, this can be 60–80% cheaper than the provider list price. Shown are rolling averages from the past 30 days.

Performance

Throughput is how fast the model writes (tokens per second — higher is better). Latency is total round-trip time (lower is better). TTFT is time-to-first-token — how long before you see anything appear (lower is better).

Uptime

Percent of requests that succeeded over the last 30 days. OpenRouter monitors every provider continuously and automatically retries on the next-best provider when one returns an error.

Benchmarks

Scores on standardized evaluations. Higher percentages are better — and rank percentile shows where this model lands among all models on OpenRouter.

Apps

Public apps that send the most traffic to this model. Good signal for what real production workloads look like — and a hint at which use cases this model is best suited for.

Activity

Token volume and request traffic to this model over time.

Quick Start

Drop-in code to call this model. OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. The only thing that changes between models is the model slug below.

Qwen: Qwen3 VL 8B Instruct

qwen/qwen3-vl-8b-instruct

Qwen: Qwen3 VL 8B Instruct

qwen/qwen3-vl-8b-instruct

Providers

Effective Pricing

Performance

Uptime

Benchmarks

Apps

Activity

Quick Start

Frequently asked questions

Providers

Effective Pricing

Performance

Uptime

Benchmarks

Apps

Activity

Quick Start

Frequently asked questions

Qwen: Qwen3 VL 8B Instruct

qwen/qwen3-vl-8b-instruct

Qwen: Qwen3 VL 8B Instruct

qwen/qwen3-vl-8b-instruct

Providers

Effective Pricing

Performance

Uptime

Benchmarks

Apps

Activity

Quick Start

Frequently asked questions

What is Qwen3 VL 8B Instruct?

What is the context length of Qwen3 VL 8B Instruct?

How much does Qwen3 VL 8B Instruct cost?

What providers serve Qwen3 VL 8B Instruct, and can I use it via API?

What modalities does Qwen3 VL 8B Instruct support?

When was Qwen3 VL 8B Instruct released?

Providers

Effective Pricing

Performance

Uptime

Benchmarks

Apps

Activity

Quick Start

Frequently asked questions

What is Qwen3 VL 8B Instruct?

What is the context length of Qwen3 VL 8B Instruct?

How much does Qwen3 VL 8B Instruct cost?

What providers serve Qwen3 VL 8B Instruct, and can I use it via API?

What modalities does Qwen3 VL 8B Instruct support?

When was Qwen3 VL 8B Instruct released?