Models
- Bagel 34B v0.2
An experimental fine-tune of Yi 34b 200k using bagel. This is the version of the fine-tune before direct preference optimization (DPO) has been applied. DPO performs better on benchmarks, but this version is likely better for creative writing, roleplay, etc.
by jondurbin8K context - Yi 34B Chat
The Yi series models are large language models trained from scratch by developers at 01.AI. This version is instruct-tuned to work better for chat.
by 01-ai4K context$0.72/M input tkns$0.72/M output tkns1.82M tokens this week - Yi 34B (base)
The Yi series models are large language models trained from scratch by developers at 01.AI.
by 01-ai4K context$0.72/M input tkns$0.72/M output tkns2K tokens this week - Yi 6B (base)
The Yi series models are large language models trained from scratch by developers at 01.AI.
by 01-ai4K context$0.126/M input tkns$0.126/M output tkns2K tokens this week