Qwen's Enhanced Large Visual Language Model. Significantly upgraded for detailed recognition capabilities and text recognition abilities, supporting ultra-high pixel resolutions up to millions of pixels and extreme aspect ratios for image input. It delivers significant performance across a broad range of visual tasks.
Recent activity on Qwen VL Plus
Total usage per day on OpenRouter
Prompt
26K
Completion
15K
Prompt tokens measure input size. Reasoning tokens show internal thinking before a response. Completion tokens reflect total output length.