FireLLaVA 13B
fireworks/firellava-13b
Updated Apr 264,096 context
A blazing fast vision-language model, FireLLaVA quickly understands both text and images. It achieves impressive chat skills in tests, and was designed to mimic multimodal GPT-4.
The first commercially permissive open source LLaVA model, trained entirely on open source LLM generated instruction following data.