FireLLaVA 13B

fireworks/firellava-13b

Updated Apr 264,096 context
$0.2/M input tkns$0.2/M output tkns

A blazing fast vision-language model, FireLLaVA quickly understands both text and images. It achieves impressive chat skills in tests, and was designed to mimic multimodal GPT-4.

The first commercially permissive open source LLaVA model, trained entirely on open source LLM generated instruction following data.

OpenRouter attempts providers in this order unless you set dynamic routing preferences. Prices displayed per million tokens.