What is Qwen2.5-VL 7B Instruct?

Qwen2.5 VL 7B is a multimodal LLM from the Qwen Team with the following key enhancements: - SoTA understanding of images of various resolution & ratio: Qwen2.5-VL achieves state-of-the-art performance on visual understanding benchmarks, including MathVista, DocVQA, RealWorldQA, MTVQA, etc. - Understanding videos of 20min+: Qwen2.5-VL can understand videos over 20 minutes for high-quality video-...

What is the context length of Qwen2.5-VL 7B Instruct?

Qwen2.5-VL 7B Instruct has a 32,768 token context window.

What modalities does Qwen2.5-VL 7B Instruct support?

Qwen2.5-VL 7B Instruct accepts text, image input and produces text output.

When was Qwen2.5-VL 7B Instruct released?

Qwen2.5-VL 7B Instruct was released on 2024-08-28. Its knowledge cutoff is 2024-06-30.

Qwen: Qwen2.5-VL 7B Instruct

Name: Qwen: Qwen2.5-VL 7B Instruct
Author: qwen

qwen/qwen-2.5-vl-7b-instruct

Model weights

Qwen2.5 VL 7B is a multimodal LLM from the Qwen Team with the following key enhancements:

SoTA understanding of images of various resolution & ratio: Qwen2.5-VL achieves state-of-the-art performance on visual understanding benchmarks, including MathVista, DocVQA, RealWorldQA, MTVQA, etc.
Understanding videos of 20min+: Qwen2.5-VL can understand videos over 20 minutes for high-quality video-based question answering, dialog, content creation, etc.
Agent that can operate your mobiles, robots, etc.: with the abilities of complex reasoning and decision making, Qwen2.5-VL can be integrated with devices like mobile phones, robots, etc., for automatic operation based on visual environment and text instructions.
Multilingual Support: to serve global users, besides English and Chinese, Qwen2.5-VL now supports the understanding of texts in different languages inside images, including most European languages, Japanese, Korean, Arabic, Vietnamese, etc.

For more details, see this blog post(opens in new tab) and GitHub repo(opens in new tab).

Usage of this model is subject to Tongyi Qianwen LICENSE AGREEMENT(opens in new tab).

Modalities

Context

33K

Released

Aug 28, 2024

Knowledge Cutoff

Jun 2024