A blazing fast vision-language model, FireLLaVA quickly understands both text and images. It achieves impressive chat skills in tests, and was designed to mimic multimodal GPT-4.
The first commercially permissive open source LLaVA model, trained entirely on open source LLM generated instruction following data.
Modalities
Context
4K
Released
Apr 26, 2024
Knowledge Cutoff
Jun 2023
Token volume and request traffic to this model over time.