Microsoft: Phi-3 Medium 128K Instruct
microsoft/phi-3-medium-128k-instruct
Created May 24, 2024128,000 context
$1/M input tokens$1/M output tokens
Phi-3 128K Medium is a powerful 14-billion parameter model designed for advanced language understanding, reasoning, and instruction following. Optimized through supervised fine-tuning and preference adjustments, it excels in tasks involving common sense, mathematics, logical reasoning, and code processing.
At time of release, Phi-3 Medium demonstrated state-of-the-art performance among lightweight models. In the MMLU-Pro eval, the model even comes close to a Llama3 70B level of performance.
For 4k context length, try Phi-3 Medium 4K.