- WizardLM-2 8x22B (nitro)
WizardLM-2 8x22B is Microsoft AI's most advanced Wizard model. It demonstrates highly competitive performance compared to leading proprietary models, and it consistently outperforms all existing state-of-the-art opensource models. It is an instruct finetune of Mixtral 8x22B. To read more about the model release, click here. #moe Note: this is a higher-throughput version of this model, and may have higher prices and slightly different outputs.
by microsoft66K context$1/M input tkns$1/M output tkns280M tokens this week - WizardLM-2 8x22B
WizardLM-2 8x22B is Microsoft AI's most advanced Wizard model. It demonstrates highly competitive performance compared to leading proprietary models, and it consistently outperforms all existing state-of-the-art opensource models. It is an instruct finetune of Mixtral 8x22B. To read more about the model release, click here. #moe
by microsoft66K context$0.65/M input tkns$0.65/M output tkns1.49B tokens this week - WizardLM-2 7B
WizardLM-2 7B is the smaller variant of Microsoft AI's latest Wizard model. It is the fastest and achieves comparable performance with existing 10x larger opensource leading models It is a finetune of Mistral 7B Instruct, using the same technique as WizardLM-2 8x22B. To read more about the model release, click here. #moe
by microsoft32K context$0.07/M input tkns$0.07/M output tkns731M tokens this week - Cohere: Command R+
Command R+ is a new, 104B-parameter LLM from Cohere. It's useful for roleplay, general consumer usecases, and Retrieval Augmented Generation (RAG). It offers multilingual support for ten key languages to facilitate global business operations. See benchmarks and the launch post here. Use of this model is subject to Cohere's Acceptable Use Policy.
by cohere130K context$3/M input tkns$15/M output tkns227M tokens this week