- OpenAI: GPT-4o
GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of GPT-4 Turbo while being twice as fast and 50% more cost-effective. GPT-4o also offers improved performance in processing non-English languages and enhanced visual capabilities. For benchmarking against other models, it was briefly called "im-also-a-good-gpt2-chatbot" #multimodal
by openai128K context$5/M input tkns$15/M output tkns$2.312/K input imgs37.6M tokens this week - OpenAI: GPT-4o (2024-05-13)
GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of GPT-4 Turbo while being twice as fast and 50% more cost-effective. GPT-4o also offers improved performance in processing non-English languages and enhanced visual capabilities. For benchmarking against other models, it was briefly called "im-also-a-good-gpt2-chatbot" #multimodal
by openai128K context$5/M input tkns$15/M output tkns$2.312/K input imgs6.7M tokens this week - OpenAI: GPT-4 Turbo
The latest GPT-4 Turbo model with vision capabilities. Vision requests can now use JSON mode and function calling. Training data: up to Dec 2023. This model is updated by OpenAI to point to the latest version of GPT-4 Turbo, currently gpt-4-turbo-2024-04-09 (as of April 2024).
by openai128K context$10/M input tkns$30/M output tkns$14.45/K input imgs17.8M tokens this week - OpenAI: GPT-3.5 Turbo 16k
The latest GPT-3.5 Turbo model with improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more. Training data: up to Sep 2021. This version has a higher accuracy at responding in requested formats and a fix for a bug which caused a text encoding issue for non-English language function calls.
by openai16K context$0.5/M input tkns$1.5/M output tkns32.7M tokens this week - OpenAI: GPT-3.5 Turbo (older v0613)
GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Updated by OpenAI to point to the latest version of GPT-3.5. Training data up to Sep 2021.
by openai4K context$1/M input tkns$2/M output tkns132K tokens this week - OpenAI: GPT-4 Turbo Preview
The latest GPT-4 model with improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more. Training data: up to Dec 2023. Note: heavily rate limited by OpenAI while in preview.
by openai128K context$10/M input tkns$30/M output tkns1.18M tokens this week - OpenAI: GPT-4 Vision
Ability to understand images, in addition to all other GPT-4 Turbo capabilties. Training data: up to Apr 2023. Note: heavily rate limited by OpenAI while in preview. #multimodal
by openai128K context$10/M input tkns$30/M output tkns$14.45/K input imgs731K tokens this week - OpenAI: GPT-3.5 Turbo 16k (older v1106)
The latest GPT-3.5 Turbo model with improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more. Training data: up to Sep 2021.
by openai16K context$1/M input tkns$2/M output tkns5.74M tokens this week - OpenAI: GPT-4 Turbo (older v1106)
The latest GPT-4 model with improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more. Training data: up to Apr 2023. Note: heavily rate limited by OpenAI while in preview.
by openai128K context$10/M input tkns$30/M output tkns8.95M tokens this week - OpenAI: GPT-3.5 Turbo Instruct
This model is a variant of GPT-3.5 Turbo tuned for instructional prompts and omitting chat-related optimizations. Training data: up to Sep 2021.
by openai4K context$1.5/M input tkns$2/M output tkns74K tokens this week - OpenAI: GPT-3.5 Turbo 16k
This model offers four times the context length of gpt-3.5-turbo, allowing it to support approximately 20 pages of text in a single request at a higher cost. Training data: up to Sep 2021.
by openai16K context$3/M input tkns$4/M output tkns1.62M tokens this week - OpenAI: GPT-4 32k
GPT-4-32k is an extended version of GPT-4, with the same capabilities but quadrupled context length, allowing for processing up to 40 pages of text in a single pass. This is particularly beneficial for handling longer content like interacting with PDFs without an external vector database. Training data: up to Sep 2021.
by openai33K context$60/M input tkns$120/M output tkns455K tokens this week - OpenAI: GPT-4 32k (older v0314)
GPT-4-32k is an extended version of GPT-4, with the same capabilities but quadrupled context length, allowing for processing up to 40 pages of text in a single pass. This is particularly beneficial for handling longer content like interacting with PDFs without an external vector database. Training data: up to Sep 2021.
by openai33K context$60/M input tkns$120/M output tkns136K tokens this week - OpenAI: GPT-3.5 Turbo
GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Updated by OpenAI to point to the latest version of GPT-3.5. Training data up to Sep 2021.
by openai16K context$0.5/M input tkns$1.5/M output tkns10.6M tokens this week - OpenAI: GPT-3.5 Turbo (older v0301)
GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Updated by OpenAI to point to the latest version of GPT-3.5. Training data up to Sep 2021.
by openai4K context$1/M input tkns$2/M output tkns1.2M tokens this week - OpenAI: GPT-4
OpenAI's flagship model, GPT-4 is a large-scale multimodal language model capable of solving difficult problems with greater accuracy than previous models due to its broader general knowledge and advanced reasoning capabilities. Training data: up to Sep 2021.
by openai8K context$30/M input tkns$60/M output tkns2.07M tokens this week - OpenAI: GPT-4 (older v0314)
GPT-4-0314 is the first version of GPT-4 released, with a context length of 8,192 tokens, and was supported until June 14. Training data: up to Sep 2021.
by openai8K context$30/M input tkns$60/M output tkns111K tokens this week - Nous: Hermes 2 Mixtral 8x7B DPO
Nous Hermes 2 Mixtral 8x7B DPO is the new flagship Nous Research model trained over the Mixtral 8x7B MoE LLM. The model was trained on over 1,000,000 entries of primarily GPT-4 generated data, as well as other high quality data from open datasets across the AI landscape, achieving state of the art performance on a variety of tasks. #moe
by nousresearch33K context$0.27/M input tkns$0.27/M output tkns2.48M tokens this week - LLaVA 13B
LLaVA is a large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding, achieving impressive chat capabilities mimicking GPT-4 and setting a new state-of-the-art accuracy on Science QA #multimodal
by liuhaotian2K context$10/M input tkns$10/M output tkns - Auto (best for prompt)
Depending on their size, subject, and complexity, your prompts will be sent to Mistral Large, Claude 3 Sonnet or GPT-4o. To see which model was used, visit Activity.
by openrouter200K context - Xwin 70B
Xwin-LM aims to develop and open-source alignment tech for LLMs. Our first release, built-upon on the Llama2 base models, ranked TOP-1 on AlpacaEval. Notably, it's the first to surpass GPT-4 on this benchmark. The project will be continuously updated.
by xwin-lm8K context$3.75/M input tkns$3.75/M output tkns647K tokens this week