Qwen's Enhanced Large Visual Language Model. Significantly upgraded for detailed recognition capabilities and text recognition abilities, supporting ultra-high pixel resolutions up to millions of pixels and extreme aspect ratios for image input. It delivers significant performance across a broad range of visual tasks.
Recent activity on Qwen VL Plus
Total usage per day on OpenRouter
Prompt
3.44M
Completion
455K
Prompt tokens measure input size. Reasoning tokens show internal thinking before a response. Completion tokens reflect total output length.