OpenChat 3.6 8B
openchat/openchat-8b
Updated Jun 18,192 context
OpenChat 8B is a library of open-source language models, fine-tuned with "C-RLFT (Conditioned Reinforcement Learning Fine-Tuning)" - a strategy inspired by offline reinforcement learning. It has been trained on mixed-quality data without preference labels.
It outperforms many similarly sized models including Llama 3 8B Instruct and various fine-tuned models. It excels in general conversation, coding assistance, and mathematical reasoning.
- For OpenChat fine-tuned on Mistral 7B, check out OpenChat 7B.
- For OpenChat fine-tuned on Llama 8B, check out OpenChat 8B.
#open-source