DeepSeek-V2 Chat
deepseek/deepseek-chat
Updated May 14128,000 context
$0.14/M input tokens$0.28/M output tokens
DeepSeek-V2 Chat is a conversational finetune of DeepSeek-V2, a Mixture-of-Experts (MoE) language model. It comprises 236B total parameters, of which 21B are activated for each token.
Compared with DeepSeek 67B, DeepSeek-V2 achieves stronger performance, and meanwhile saves 42.5% of training costs, reduces the KV cache by 93.3%, and boosts the maximum generation throughput to 5.76 times.
DeepSeek-V2 achieves remarkable performance on both standard benchmarks and open-ended generation evaluations.