Microsoft: Phi-3.5 Mini 128K Instruct

microsoft/phi-3.5-mini-128k-instruct

Phi-3.5 models are lightweight, state-of-the-art open models. These models were trained with Phi-3 datasets that include both synthetic data and the filtered, publicly available websites data, with a focus on high quality and reasoning-dense properties. Phi-3.5 Mini uses 3.8B parameters, and is a dense decoder-only transformer model using the same tokenizer as Phi-3 Mini.

The models underwent a rigorous enhancement process, incorporating both supervised fine-tuning, proximal policy optimization, and direct preference optimization to ensure precise instruction adherence and robust safety measures. When assessed against benchmarks that test common sense, language understanding, math, code, long context and logical reasoning, Phi-3.5 models showcased robust and state-of-the-art performance among models with less than 13 billion parameters.

Modalities

Context

128K

Released

Aug 21, 2024

Recent activity on Phi-3.5 Mini 128K Instruct

Total usage per day on OpenRouter

Not enough data to display yet.

Microsoft: Phi-3.5 Mini 128K Instruct