LLaVA 13B

liuhaotian/llava-13b

Updated Nov 162,048 context

LLaVA is a large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding, achieving impressive chat capabilities and setting a new state-of-the-art accuracy on Science QA.

#multimodal

LLaVA 13B

liuhaotian/llava-13b

LLaVA 13B

liuhaotian/llava-13b

Tokens processed per day by LLaVA 13B

Tokens processed per day by LLaVA 13B