Google: Gemini 1.5 Flash-8B
google/gemini-flash-1.5-8b
Created Oct 31,000,000 context
$0.0375/M input tokens$0.15/M output tokens
Gemini 1.5 Flash-8B is optimized for speed and efficiency, offering enhanced performance in small prompt tasks like chat, transcription, and translation. With reduced latency, it is highly effective for real-time and large-scale operations. This model focuses on cost-effective solutions while maintaining high-quality results.
Click here to learn more about this model.
Usage of Gemini is subject to Google's Gemini Terms of Use.