NeverSleep: Lumimaid v0.2 8B
neversleep/llama-3.1-lumimaid-8b
Created Sep 15, 202432,768 context
$0.1875/M input tokens$1.125/M output tokens
Lumimaid v0.2 8B is a finetune of Llama 3.1 8B with a "HUGE step up dataset wise" compared to Lumimaid v0.1. Sloppy chats output were purged.
Usage of this model is subject to Meta's Acceptable Use Policy.
Providers for Lumimaid v0.2 8B
OpenRouter routes requests to the best providers that are able to handle your prompt size and parameters, with fallbacks to maximize uptime.
fp16 | Context 33K Max Output 2K Input $0.1875 Output $1.125 Latency 0.94s Throughput 34.96t/s |
fp16 | Context 33K Max Output 2K Input $0.25 Output $1.5 Latency 0.68s Throughput 35.45t/s |
fp8 | Context 16K Max Output 4K Input $0.8 Output $1.2 Latency 1.70s Throughput 25.90t/s |