- OpenAI: GPT-4 Vision (preview)
Ability to understand images, in addition to all other GPT-4 Turbo capabilties. Training data: up to Apr 2023.
Note: heavily rate limited by OpenAI while in preview.
by openai128k context$10.00/M input tkns$30.00/M output tkns$14.45 / 1K input images3.2M tokens this week - OpenAI: GPT-3.5 Turbo 16k (preview)
The latest GPT-3.5 Turbo model with improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more. Training data: up to Sep 2021.
by openai16k context$1.00/M input tkns$2.00/M output tkns19.4M tokens this week - OpenAI: GPT-4 Turbo (preview)
The latest GPT-4 model with improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more. Training data: up to Apr 2023.
Note: heavily rate limited by OpenAI while in preview.
by openai128k context$10.00/M input tkns$30.00/M output tkns82.3M tokens this week - OpenAI: GPT-3.5 Turbo Instruct
This model is a variant of GPT-3.5 Turbo tuned for instructional prompts and omitting chat-related optimizations. Training data: up to Sep 2021.
by openai4k context$1.50/M input tkns$2.00/M output tkns5.2M tokens this week - OpenAI: GPT-3.5 Turbo 16k
This model offers four times the context length of gpt-3.5-turbo, allowing it to support approximately 20 pages of text in a single request at a higher cost. Training data: up to Sep 2021.
by openai16k context$3.00/M input tkns$4.00/M output tkns34.8M tokens this week - OpenAI: GPT-4 32k
GPT-4-32k is an extended version of GPT-4, with the same capabilities but quadrupled context length, allowing for processing up to 40 pages of text in a single pass. This is particularly beneficial for handling longer content like interacting with PDFs without an external vector database. Training data: up to Sep 2021.
by openai33k context$60.00/M input tkns$120.00/M output tkns23.4M tokens this week - OpenAI: GPT-4 32k (older v0314)
GPT-4-32k is an extended version of GPT-4, with the same capabilities but quadrupled context length, allowing for processing up to 40 pages of text in a single pass. This is particularly beneficial for handling longer content like interacting with PDFs without an external vector database. Training data: up to Sep 2021.
by openai33k context$60.00/M input tkns$120.00/M output tkns1.3M tokens this week - OpenAI: GPT-3.5 Turbo
GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data: up to Sep 2021.
by openai4k context$1.00/M input tkns$2.00/M output tkns122.4M tokens this week - OpenAI: GPT-3.5 Turbo (older v0301)
GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data: up to Sep 2021.
by openai4k context$1.00/M input tkns$2.00/M output tkns822.8K tokens this week - OpenAI: GPT-4
OpenAI's flagship model, GPT-4 is a large-scale multimodal language model capable of solving difficult problems with greater accuracy than previous models due to its broader general knowledge and advanced reasoning capabilities. Training data: up to Sep 2021.
by openai8k context$30.00/M input tkns$60.00/M output tkns71.8M tokens this week - OpenAI: GPT-4 (older v0314)
GPT-4-0314 is the first version of GPT-4 released, with a context length of 8,192 tokens, and was supported until June 14. Training data: up to Sep 2021.
by openai8k context$30.00/M input tkns$60.00/M output tkns2.3M tokens this week - OpenAI: Davinci 2
An InstructGPT model derived from the code-davinci-002 model, designed to follow instructions in prompts to provide detailed responses. Training data: up to Sep 2021.
by openai4k context$20.00/M input tkns$20.00/M output tkns21.7K tokens this week - Llava 13B
LLaVA is a large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding, achieving impressive chat capabilities mimicking GPT-4 and setting a new state-of-the-art accuracy on Science QA
by haotian-liu2k context$5.00/M input tkns$5.00/M output tkns621.8K tokens this week - Auto (best for prompt)
Depending on their size, subject, and complexity, your prompts will be sent to MythoMax 13B, MythoMax 13B 8k or GPT-4 Turbo. To see which model was used, visit Activity.
by openrouter128k context - Xwin 70B
Xwin-LM aims to develop and open-source alignment tech for LLMs. Our first release, built-upon on the Llama2 base models, ranked TOP-1 on AlpacaEval. Notably, it's the first to surpass GPT-4 on this benchmark. The project will be continuously updated.
by xwin-lm8k context$6.56/M input tkns$6.56/M output tkns27.8M tokens this week