Skip to content
  1.  
  2. © 2023 – 2025 OpenRouter, Inc
    Favicon for perplexity

    Perplexity

    Browse models from Perplexity

    13 models

    Tokens processed on OpenRouter

    • Perplexity: Sonar Pro SearchSonar Pro Search
      598K tokens

      Exclusively available on the OpenRouter API, Sonar Pro's new Pro Search mode is Perplexity's most advanced agentic search system. It is designed for deeper reasoning and analysis. Pricing is based on tokens plus $18 per thousand requests. This model powers the Pro Search mode on the Perplexity platform. Sonar Pro Search adds autonomous, multi-step reasoning to Sonar Pro. So, instead of just one query + synthesis, it plans and executes entire research workflows using tools.

      by perplexity200K context$3/M input tokens
    $15/M output tokens
    $18/K reqs
  3. Perplexity: Sonar Reasoning ProSonar Reasoning Pro
    18.7M tokens

    Note: Sonar Pro pricing includes Perplexity search pricing. See details here Sonar Reasoning Pro is a premier reasoning model powered by DeepSeek R1 with Chain of Thought (CoT). Designed for advanced use cases, it supports in-depth, multi-step queries with a larger context window and can surface more citations per search, enabling more comprehensive and extensible responses.

    by perplexity128K context$2/M input tokens$8/M output tokens
  4. Perplexity: Sonar ProSonar Pro
    31.7M tokens

    Note: Sonar Pro pricing includes Perplexity search pricing. See details here For enterprises seeking more advanced capabilities, the Sonar Pro API can handle in-depth, multi-step queries with added extensibility, like double the number of citations per search as Sonar on average. Plus, with a larger context window, it can handle longer and more nuanced searches and follow-up questions.

    by perplexity200K context$3/M input tokens$15/M output tokens
  5. Perplexity: Sonar Deep ResearchSonar Deep Research
    84.5M tokens

    Sonar Deep Research is a research-focused model designed for multi-step retrieval, synthesis, and reasoning across complex topics. It autonomously searches, reads, and evaluates sources, refining its approach as it gathers information. This enables comprehensive report generation across domains like finance, technology, health, and current events. Notes on Pricing (Source) - Input tokens comprise of Prompt tokens (user prompt) + Citation tokens (these are processed tokens from running searches) - Deep Research runs multiple searches to conduct exhaustive research. Searches are priced at $5/1000 searches. A request that does 30 searches will cost $0.15 in this step. - Reasoning is a distinct step in Deep Research since it does extensive automated reasoning through all the material it gathers during its research phase. Reasoning tokens here are a bit different than the CoTs in the answer - these are tokens that we use to reason through the research material prior to generating the outputs via the CoTs. Reasoning tokens are priced at $3/1M tokens

    by perplexity128K context$2/M input tokens$8/M output tokens
  6. Perplexity: R1 1776R1 1776

    R1 1776 is a version of DeepSeek-R1 that has been post-trained to remove censorship constraints related to topics restricted by the Chinese government. The model retains its original reasoning capabilities while providing direct responses to a wider range of queries. R1 1776 is an offline chat model that does not use the perplexity search subsystem. The model was tested on a multilingual dataset of over 1,000 examples covering sensitive topics to measure its likelihood of refusal or overly filtered responses. Evaluation Results Its performance on math and reasoning benchmarks remains similar to the base R1 model. Reasoning Performance Read more on the Blog Post

    by perplexity128K context
  7. Perplexity: Sonar ReasoningSonar Reasoning
    8.59M tokens

    Sonar Reasoning is a reasoning model provided by Perplexity based on DeepSeek R1. It allows developers to utilize long chain of thought with built-in web search. Sonar Reasoning is uncensored and hosted in US datacenters.

    by perplexity127K context$1/M input tokens$5/M output tokens$5/K reqs
  8. Perplexity: SonarSonar
    36.6M tokens

    Sonar is lightweight, affordable, fast, and simple to use — now featuring citations and the ability to customize sources. It is designed for companies seeking to integrate lightweight question-and-answer features optimized for speed.

    by perplexity127K context$1/M input tokens$1/M output tokens$5/K reqs
  9. Perplexity: Llama 3.1 Sonar 8B OnlineLlama 3.1 Sonar 8B Online

    Llama 3.1 Sonar is Perplexity's latest model family. It surpasses their earlier Sonar models in cost-efficiency, speed, and performance. This is the online version of the offline chat model. It is focused on delivering helpful, up-to-date, and factual responses. #online

    by perplexity127K context
  10. Perplexity: Llama 3.1 Sonar 70B OnlineLlama 3.1 Sonar 70B Online

    Llama 3.1 Sonar is Perplexity's latest model family. It surpasses their earlier Sonar models in cost-efficiency, speed, and performance. This is the online version of the offline chat model. It is focused on delivering helpful, up-to-date, and factual responses. #online

    by perplexity127K context
  11. Perplexity: Llama3 Sonar 8B OnlineLlama3 Sonar 8B Online

    Llama3 Sonar is Perplexity's latest model family. It surpasses their earlier Sonar models in cost-efficiency, speed, and performance. This is the online version of the offline chat model. It is focused on delivering helpful, up-to-date, and factual responses. #online

    by perplexity28K context
  12. Perplexity: Llama3 Sonar 8BLlama3 Sonar 8B

    Llama3 Sonar is Perplexity's latest model family. It surpasses their earlier Sonar models in cost-efficiency, speed, and performance. This is a normal offline LLM, but the online version of this model has Internet access.

    by perplexity33K context
  13. Perplexity: Llama3 Sonar 70B OnlineLlama3 Sonar 70B Online

    Llama3 Sonar is Perplexity's latest model family. It surpasses their earlier Sonar models in cost-efficiency, speed, and performance. This is the online version of the offline chat model. It is focused on delivering helpful, up-to-date, and factual responses. #online

    by perplexity28K context
  14. Perplexity: Llama3 Sonar 70BLlama3 Sonar 70B

    Llama3 Sonar is Perplexity's latest model family. It surpasses their earlier Sonar models in cost-efficiency, speed, and performance. This is a normal offline LLM, but the online version of this model has Internet access.

    by perplexity33K context