Mancer

Browse models provided by Mancer (Terms of Service)

7 models

Tokens processed on OpenRouter

DeepSeek: DeepSeek V4 Flash 0731DeepSeek V4 Flash 0731
DeepSeek V4 Flash 0731 is a sparse mixture-of-experts model from DeepSeek, with 13B active parameters out of 284B total. This re-post-trained revision is suited for coding, reasoning, and agent workflows.
by deepseekJul 31, 20261.05M context$0.25/M input tokens$1/M output tokens

Mancer

Browse models provided by Mancer (Terms of Service)

7 models

Tokens processed on OpenRouter

DeepSeek: DeepSeek V4 Flash 0731DeepSeek V4 Flash 0731
DeepSeek V4 Flash 0731 is a sparse mixture-of-experts model from DeepSeek, with 13B active parameters out of 284B total. This re-post-trained revision is suited for coding, reasoning, and agent workflows.
by deepseekJul 31, 20261.05M context$0.25/M input tokens$1/M output tokens

DeepSeek: DeepSeek V4 FlashDeepSeek V4 Flash

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and high-throughput workloads, while maintaining strong reasoning and coding performance. The model includes hybrid attention for efficient long-context processing. Reasoning efforts high and xhigh are supported; xhigh maps to max reasoning. It is well suited for applications such as coding assistants, chat systems, and agent workflows where responsiveness and cost efficiency are important.

by deepseekApr 24, 20261.05M context$0.20/M input tokens$0.50/M output tokens

OpenAI: gpt-oss-120bgpt-oss-120b

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized to run on a single H100 GPU with native MXFP4 quantization. The model supports configurable reasoning depth, full chain-of-thought access, and native tool use, including function calling, browsing, and structured output generation.

by openaiAug 5, 2025131K context$0.06/M input tokens$0.50/M output tokens

Magnum v4 72BMagnum v4 72B

This is a series of models designed to replicate the prose quality of the Claude 3 models, specifically Sonnet(https://openrouter.ai/anthropic/claude-3.5-sonnet) and Opus(https://openrouter.ai/anthropic/claude-3-opus). The model is fine-tuned on top of Qwen2.5 72B.

by anthracite-orgOct 22, 202433K context$3/M input tokens$5/M output tokens

Mancer: Weaver (alpha)Weaver (alpha)

An attempt to recreate Claude-style verbosity, but don't expect the same level of coherence or memory. Meant for use in roleplay/narrative situations.

by mancerAug 2, 20238K context$0.50/M input tokens$0.75/M output tokens

ReMM SLERP 13BReMM SLERP 13B

A recreation trial of the original MythoMax-L2-B13 but with updated models. #merge

by undi95Jul 22, 20234K context$0.45/M input tokens$0.65/M output tokens

MythoMax 13BMythoMax 13B

One of the highest performing and most popular fine-tunes of Llama 2 13B, with rich descriptions and roleplay. #merge

by grypheJul 2, 20234K context$0.45/M input tokens$0.65/M output tokens