Model Comparison

Context Length33K

Note: This model is being deprecated. Recommended replacement is the newer Ministral 8B

This model is currently powered by Mistral-7B-v0.2, and incorporates a "better" fine-tuning than Mistral 7B, inspired by community work. It's best used for large batch processing tasks where cost is a significant factor but reasoning capabilities are not crucial.

Provider

Pricing

Input$0.25 / M tokens
Output$0.25 / M tokens
Images– –

Endpoint Features

Quantizationunknown
Max Tokens (input + output)33K
Max Output Tokens– –
Stream cancellation– –
Supports Tools– –
No Prompt Training
Reasoning– –