Stheno 8B 32K is a creative writing/roleplay model from Sao10k. It was trained at 8K context, then expanded to 32K context.
Compared to older Stheno version, this model is trained on:
2x the amount of creative writing samples
Cleaned up roleplaying samples
Fewer low quality samples
Sample code and API for Llama 3 Stheno 8B v3.3 32K
OpenRouter normalizes requests and responses across providers for you.
To get started, you can use Llama 3 Stheno 8B v3.3 32K via API like this:
fetch("https://openrouter.ai/api/v1/chat/completions",{ method:"POST", headers:{"Authorization":`Bearer ${OPENROUTER_API_KEY}`,"HTTP-Referer":`${YOUR_SITE_URL}`,// Optional, for including your app on openrouter.ai rankings."X-Title":`${YOUR_SITE_NAME}`,// Optional. Shows in rankings on openrouter.ai."Content-Type":"application/json"}, body:JSON.stringify({"model":"sao10k/l3-stheno-8b","messages":[{"role":"user","content":"What is the meaning of life?"},],})});
You can also use OpenRouter with OpenAI's client API:
import OpenAI from"openai"const openai =newOpenAI({ baseURL:"https://openrouter.ai/api/v1", apiKey: $OPENROUTER_API_KEY, defaultHeaders:{"HTTP-Referer": $YOUR_SITE_URL,// Optional, for including your app on openrouter.ai rankings."X-Title": $YOUR_SITE_NAME,// Optional. Shows in rankings on openrouter.ai.}})asyncfunctionmain(){const completion =await openai.chat.completions.create({ model:"sao10k/l3-stheno-8b", messages:[{ role:"user", content:"Say this is a test"}],})console.log(completion.choices[0].message)}main()
See the Request docs for all possible parameters, and Parameters for recommended values.