Skip to content
  1.  
  2. © 2023 – 2025 OpenRouter, Inc
    Favicon for Mancer 2

    Mancer (private)

    Browse models provided by Mancer (private) (Terms of Service)

    9 models

    Tokens processed on OpenRouter

    • Z.AI: GLM 4.6GLM 4.6

      Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex agentic tasks. Superior coding performance: The model achieves higher scores on code benchmarks and demonstrates better real-world performance in applications such as Claude Code、Cline、Roo Code and Kilo Code, including improvements in generating visually polished front-end pages. Advanced reasoning: GLM-4.6 shows a clear improvement in reasoning performance and supports tool use during inference, leading to stronger overall capability. More capable agents: GLM-4.6 exhibits stronger performance in tool using and search-based agents, and integrates more effectively within agent frameworks. Refined writing: Better aligns with human preferences in style and readability, and performs more naturally in role-playing scenarios.

      by z-ai
    200K context
    $0.45/M input tokens$2/M output tokens
  3. Z.AI: GLM 4.5GLM 4.5

    GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly enhanced capabilities in reasoning, code generation, and agent alignment. It supports a hybrid inference mode with two options, a "thinking mode" designed for complex reasoning and tool use, and a "non-thinking mode" optimized for instant responses. Users can control the reasoning behaviour with the reasoning enabled boolean. Learn more in our docs

    by z-ai131K context$0.35/M input tokens$2/M output tokens
  4. Magnum v4 72BMagnum v4 72B

    This is a series of models designed to replicate the prose quality of the Claude 3 models, specifically Sonnet(https://openrouter.ai/anthropic/claude-3.5-sonnet) and Opus(https://openrouter.ai/anthropic/claude-3-opus). The model is fine-tuned on top of Qwen2.5 72B.

    by anthracite-org33K context$3/M input tokens$5/M output tokens
  5. NeverSleep: Lumimaid v0.2 8BLumimaid v0.2 8B

    Lumimaid v0.2 8B is a finetune of Llama 3.1 8B with a "HUGE step up dataset wise" compared to Lumimaid v0.1. Sloppy chats output were purged. Usage of this model is subject to Meta's Acceptable Use Policy.

    by neversleep131K context$0.10/M input tokens$0.50/M output tokens
  6. Noromaid 20BNoromaid 20B

    A collab between IkariDev and Undi. This merge is suitable for RP, ERP, and general knowledge. #merge #uncensored

    by neversleep8K context$1/M input tokens$2/M output tokens
  7. Goliath 120BGoliath 120B

    A large LLM created by combining two fine-tuned Llama 70B models into one 120B model. Combines Xwin and Euryale. Credits to - @chargoddard for developing the framework used to merge the model - mergekit. - @Undi95 for helping with the merge ratios. #merge

    by alpindale6K context$6/M input tokens$8/M output tokens
  8. Mancer: Weaver (alpha)Weaver (alpha)

    An attempt to recreate Claude-style verbosity, but don't expect the same level of coherence or memory. Meant for use in roleplay/narrative situations.

    by mancer8K context$1.125/M input tokens$1.125/M output tokens
  9. ReMM SLERP 13BReMM SLERP 13B

    A recreation trial of the original MythoMax-L2-B13 but with updated models. #merge

    by undi954K context$0.50/M input tokens$0.75/M output tokens
  10. MythoMax 13BMythoMax 13B

    One of the highest performing and most popular fine-tunes of Llama 2 13B, with rich descriptions and roleplay. #merge

    by gryphe4K context$0.50/M input tokens$0.75/M output tokens