- Nous: Hermes 2 Mistral 7B DPO
This is the flagship 7B Hermes model, a Direct Preference Optimization (DPO) of Teknium/OpenHermes-2.5-Mistral-7B. It shows improvement across the board on all benchmarks tested - AGIEval, BigBench Reasoning, GPT4All, and TruthfulQA. The model prior to DPO was trained on 1,000,000 instructions/chats of GPT-4 quality or better, primarily synthetic data as well as other high quality datasets.
by nousresearch8K context$0.18/M input tkns$0.18/M output tkns10.3M tokens this week - Nous: Hermes 2 Vision 7B (alpha)
This vision-language model builds on innovations from the popular OpenHermes-2.5 model, by Teknium. It adds vision support, and is trained on a custom dataset enriched with function calling This project is led by qnguyen3 and teknium. #multimodal
by nousresearch4K context$10/M input tkns$10/M output tkns - MythoMist 7B (free)
From the creator of MythoMax, merges a suite of models to reduce word anticipation, ministrations, and other undesirable words in ChatGPT roleplaying data. It combines Neural Chat 7B, Airoboros 7b, Toppy M 7B, Zepher 7b beta, Nous Capybara 34B, OpenHeremes 2.5, and many others. #merge Note: this is a free, rate-limited version of this model. Outputs may be cached. Read about rate limits here.
by gryphe33K context$0/M input tkns$0/M output tkns580K tokens this week - MythoMist 7B
From the creator of MythoMax, merges a suite of models to reduce word anticipation, ministrations, and other undesirable words in ChatGPT roleplaying data. It combines Neural Chat 7B, Airoboros 7b, Toppy M 7B, Zepher 7b beta, Nous Capybara 34B, OpenHeremes 2.5, and many others. #merge
by gryphe33K context$0.375/M input tkns$0.375/M output tkns76K tokens this week - OpenHermes 2.5 Mistral 7B
A continuation of OpenHermes 2 model, trained on additional code datasets. Potentially the most interesting finding from training on a good ratio (est. of around 7-14% of the total dataset) of code instruction was that it has boosted several non-code benchmarks, including TruthfulQA, AGIEval, and GPT4All suite. It did however reduce BigBench benchmark score, but the net gain overall is significant.
by teknium4K context$0.17/M input tkns$0.17/M output tkns17.6M tokens this week - OpenHermes 2 Mistral 7B
Trained on 900k instructions, surpasses all previous versions of Hermes 13B and below, and matches 70B on some benchmarks. Hermes 2 has strong multiturn chat skills and system prompt capabilities.
by teknium4K context$0.18/M input tkns$0.18/M output tkns3.09M tokens this week - Nous: Hermes 13B
A state-of-the-art language model fine-tuned on over 300k instructions by Nous Research, with Teknium and Emozilla leading the fine tuning process.
by nousresearch4K context$0.18/M input tkns$0.18/M output tkns6.71M tokens this week