Skip to content
  •  
  • © 2023 – 2025 OpenRouter, Inc
      Favicon for Infermatic

      Infermatic

      Browse models provided by Infermatic (Terms of Service)

      8 models

      Tokens processed

      • Sao10K: Llama 3.3 Euryale 70B

        Euryale L3.3 70B is a model focused on creative roleplay from Sao10k. It is the successor of Euryale L3 70B v2.2.

        by sao10k8K context$1.50/M input tokens$1.50/M output tokens
      • SorcererLM 8x22B

        SorcererLM is an advanced RP and storytelling model, built as a Low-rank 16-bit LoRA fine-tuned on WizardLM-2 8x22B. - Advanced reasoning and emotional intelligence for engaging and immersive interactions - Vivid writing capabilities enriched with spatial and contextual awareness - Enhanced narrative depth, promoting creative and dynamic storytelling

        by raifle16K context$4.50/M input tokens$4.50/M output tokens
      • Unslopnemo 12B

        UnslopNemo v4.1 is the latest addition from the creator of Rocinante, designed for adventure writing and role-play scenarios.

        by thedrummer32K context$0.50/M input tokens$0.50/M output tokens
      • Magnum v4 72B

        This is a series of models designed to replicate the prose quality of the Claude 3 models, specifically Sonnet(https://openrouter.ai/anthropic/claude-3.5-sonnet) and Opus(https://openrouter.ai/anthropic/claude-3-opus). The model is fine-tuned on top of Qwen2.5 72B.

        by anthracite-org33K context$3/M input tokens$3/M output tokens
      • NVIDIA: Llama 3.1 Nemotron 70B Instruct

        NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging Llama 3.1 70B architecture and Reinforcement Learning from Human Feedback (RLHF), it excels in automatic alignment benchmarks. This model is tailored for applications requiring high accuracy in helpfulness and response generation, suitable for diverse user queries across multiple domains. Usage of this model is subject to Meta's Acceptable Use Policy.

        by nvidia131K context$1/M input tokens$1/M output tokens
      • Magnum v2 72B

        From the maker of Goliath, Magnum 72B is the seventh in a family of models designed to achieve the prose quality of the Claude 3 models, notably Opus & Sonnet. The model is based on Qwen2 72B and trained with 55 million tokens of highly curated roleplay (RP) data.

        by anthracite-org33K context$3/M input tokens$3/M output tokens
      • Rocinante 12B

        Rocinante 12B is designed for engaging storytelling and rich prose. Early testers have reported: - Expanded vocabulary with unique and expressive word choices - Enhanced creativity for vivid narratives - Adventure-filled and captivating stories

        by thedrummer33K context$0.25/M input tokens$0.50/M output tokens
      • Sao10K: Llama 3.1 Euryale 70B v2.2

        Euryale L3.1 70B v2.2 is a model focused on creative roleplay from Sao10k. It is the successor of Euryale L3 70B v2.1.

        by sao10k131K context$1.50/M input tokens$1.50/M output tokens