Skip to content
  1.  
  2. © 2023 – 2025 OpenRouter, Inc

    Inception: Mercury

    inception/mercury

    Created Jun 26, 2025128,000 context
    $0.25/M input tokens$1/M output tokens

    Mercury is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like GPT-4.1 Nano and Claude 3.5 Haiku while matching their performance. Mercury's speed enables developers to provide responsive user experiences, including with voice agents, search interfaces, and chatbots. Read more in the [blog post] (https://www.inceptionlabs.ai/blog/introducing-mercury) here.

    Recent activity on Mercury

    Total usage per day on OpenRouter