Nous: Hermes 2 Mixtral 8x7B DPO

nousresearch/nous-hermes-2-mixtral-8x7b-dpo

Updated Jan 1632,768 context
$0.27/M input tkns$0.27/M output tkns

Nous Hermes 2 Mixtral 8x7B DPO is the new flagship Nous Research model trained over the Mixtral 8x7B MoE LLM.

The model was trained on over 1,000,000 entries of primarily GPT-4 generated data, as well as other high quality data from open datasets across the AI landscape, achieving state of the art performance on a variety of tasks.

#moe