- Meta: Llama 3 8B Instruct (extended)
Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 8B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong performance compared to leading closed-source models in human evaluations. To read more about the model release, click here. Usage of this model is subject to Meta's Acceptable Use Policy. Note: this is an extended-context version of this model. It may have higher prices and different outputs.
by meta-llama16K context$0.2751/M input tkns$2.826/M output tkns12.9M tokens this week - Meta: Llama 3 8B Instruct (nitro)
Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 8B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong performance compared to leading closed-source models in human evaluations. To read more about the model release, click here. Usage of this model is subject to Meta's Acceptable Use Policy. Note: this is a higher-throughput version of this model, and may have higher prices and slightly different outputs.
by meta-llama8.2K context$0.2/M input tkns$0.2/M output tkns49.3M tokens this week - Meta: Llama 3 70B Instruct (nitro)
Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 70B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong performance compared to leading closed-source models in human evaluations. To read more about the model release, click here. Usage of this model is subject to Meta's Acceptable Use Policy. Note: this is a higher-throughput version of this model, and may have higher prices and slightly different outputs.
by meta-llama8.2K context$0.9/M input tkns$0.9/M output tkns416M tokens this week - Meta: Llama 3 70B Instruct
Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 70B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong performance compared to leading closed-source models in human evaluations. To read more about the model release, click here. Usage of this model is subject to Meta's Acceptable Use Policy.
by meta-llama8.2K context$0.81/M input tkns$0.81/M output tkns1.8B tokens this week - Meta: Llama 3 8B Instruct
Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 8B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong performance compared to leading closed-source models in human evaluations. To read more about the model release, click here. Usage of this model is subject to Meta's Acceptable Use Policy.
by meta-llama8.2K context$0.1/M input tkns$0.1/M output tkns2.16B tokens this week - Meta: Llama v2 70B Chat (nitro)
The flagship, 70 billion parameter language model from Meta, fine tuned for chat completions. Llama 2 is an auto-regressive language model that uses an optimized transformer architecture. The tuned versions use supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to align to human preferences for helpfulness and safety. Note: this is a higher-throughput version of this model, and may have higher prices and slightly different outputs.
by meta-llama4.1K context$0.9/M input tkns$0.9/M output tkns1.93M tokens this week - Meta: CodeLlama 70B Instruct
Code Llama is a family of large language models for code. This one is based on Llama 2 70B and provides zero-shot instruction-following ability for programming tasks.
by codellama2K context$0.81/M input tkns$0.81/M output tkns3.48M tokens this week - Perplexity: PPLX 70B Online
The larger, internet-connected chat model by Perplexity Labs, based on Llama 2 70B. The online models are focused on delivering helpful, up-to-date, and factual responses. #online
by perplexity4.1K context$1/M input tkns$1/M output tkns$5/K reqs2.48M tokens this week - Perplexity: PPLX 70B Chat
The larger chat model by Perplexity Labs, with 70 billion parameters. Based on Llama 2 70B.
by perplexity4.1K context$1/M input tkns$1/M output tkns1.88M tokens this week - Psyfighter 13B
A #merge model based on Llama-2-13B and made possible thanks to the compute provided by the KoboldAI community. It's a merge between: - KoboldAI/LLaMA2-13B-Tiefighter - chaoyi-wu/MedLLaMA_13B - Doctor-Shotgun/llama-2-13b-chat-limarp-v2-merged. #merge
by jebcarter4.1K context - Xwin 70B
Xwin-LM aims to develop and open-source alignment tech for LLMs. Our first release, built-upon on the Llama2 base models, ranked TOP-1 on AlpacaEval. Notably, it's the first to surpass GPT-4 on this benchmark. The project will be continuously updated.
by xwin-lm8.2K context$3.75/M input tkns$3.75/M output tkns10.8M tokens this week - Meta: CodeLlama 34B Instruct
Code Llama is built upon Llama 2 and excels at filling in code, handling extensive input contexts, and folling programming instructions without prior training for various programming tasks.
by meta-llama8.2K context$0.72/M input tkns$0.72/M output tkns504K tokens this week - Meta: Llama v2 13B Chat
A 13 billion parameter language model from Meta, fine tuned for chat completions
by meta-llama4.1K context$0.13/M input tkns$0.13/M output tkns18.7M tokens this week - Meta: Llama v2 70B Chat
The flagship, 70 billion parameter language model from Meta, fine tuned for chat completions. Llama 2 is an auto-regressive language model that uses an optimized transformer architecture. The tuned versions use supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to align to human preferences for helpfulness and safety.
by meta-llama4.1K context$0.64/M input tkns$0.8/M output tkns7.46M tokens this week