Perplexity: Llama 3.1 Sonar 8B Online

perplexity/llama-3.1-sonar-small-128k-online

Created Aug 1, 2024127,072 context
$0.20/M input tokens$0.20/M output tokens$5/K reqs

Llama 3.1 Sonar is Perplexity's latest model family. It surpasses their earlier Sonar models in cost-efficiency, speed, and performance.

This is the online version of the offline chat model. It is focused on delivering helpful, up-to-date, and factual responses. #online

Providers for Llama 3.1 Sonar 8B Online

OpenRouter routes requests to the best providers that are able to handle your prompt size and parameters, with fallbacks to maximize uptime.

Apps using Llama 3.1 Sonar 8B Online

Top public apps this week using this model

Recent activity on Llama 3.1 Sonar 8B Online

Tokens processed per day

Feb 7Feb 13Feb 19Feb 25Mar 3Mar 9Mar 15Mar 21Mar 27Apr 2Apr 8Apr 14Apr 20Apr 26May 2May 8020M40M60M80M

Versions by Token Share

Uptime stats for Llama 3.1 Sonar 8B Online

Uptime stats for Llama 3.1 Sonar 8B Online across all providers

When an error occurs in an upstream provider, we can recover by routing to another healthy provider, if your request filters allow it.

Learn more about our load balancing and customization options.

Sample code and API for Llama 3.1 Sonar 8B Online

OpenRouter normalizes requests and responses across providers for you.

OpenRouter provides an OpenAI-compatible completion API to 300+ models & providers that you can call directly, or using the OpenAI SDK. Additionally, some third-party SDKs are available.

In the examples below, the OpenRouter-specific headers are optional. Setting them allows your app to appear on the OpenRouter leaderboards.

from openai import OpenAI

client = OpenAI(
  base_url="https://openrouter.ai/api/v1",
  api_key="<OPENROUTER_API_KEY>",
)

completion = client.chat.completions.create(
  extra_headers={
    "HTTP-Referer": "<YOUR_SITE_URL>", # Optional. Site URL for rankings on openrouter.ai.
    "X-Title": "<YOUR_SITE_NAME>", # Optional. Site title for rankings on openrouter.ai.
  },
  extra_body={},
  model="perplexity/llama-3.1-sonar-small-128k-online",
  messages=[
    {
      "role": "user",
      "content": "What is the meaning of life?"
    }
  ]
)
print(completion.choices[0].message.content)

Using third-party SDKs

For information about using third-party SDKs and frameworks with OpenRouter, please see our frameworks documentation.

See the Request docs for all possible fields, and Parameters for explanations of specific sampling parameters.

    Llama 3.1 Sonar 8B Online - API, Providers, Stats | OpenRouter