Gemini 1.5 Flash-8B is optimized for speed and efficiency, offering enhanced performance in small prompt tasks like chat, transcription, and translation. With reduced latency, it is highly effective for real-time and large-scale operations. This model focuses on cost-effective solutions while maintaining high-quality results.
Click here to learn more about this model.
Usage of Gemini is subject to Google's Gemini Terms of Use.
Liquid's 40.3B Mixture of Experts (MoE) model. Liquid Foundation Models (LFMs) are large neural networks built with computational units rooted in dynamic systems.
LFMs are general-purpose AI models that can be used to model any kind of sequential data, including video, audio, text, time series, and signals.
See the launch announcement for benchmarks and more info.
These are free, rate-limited endpoints for LFM 40B MoE. Outputs may be cached. Read about rate limits here.