The o1 series of models are trained with reinforcement learning to think before they answer and perform complex reasoning. The o1-pro model uses more compute to think harder and provide consistently better answers.
OpenAI: o1-pro – Uptime and Availability | OpenRouter