Mixtral 8x7B (base)

mistralai/mixtral-8x7b

Updated Dec 1032,768 context
$0.54/M input tkns$0.54/M output tkns

A pretrained generative Sparse Mixture of Experts, by Mistral AI. Incorporates 8 experts (feed-forward networks) for a total of 47B parameters. Base model (not fine-tuned for instructions) - see Mixtral 8x7B Instruct for an instruct-tuned model.

#moe