Search/
Skip to content
/

Sao10K: Llama 3 Stheno 8B v3.3 32K

sao10k/l3-stheno-8b

Released Jun 27, 202432,000 context

Stheno 8B 32K is a creative writing/roleplay model from Sao10k. It was trained at 8K context, then expanded to 32K context.

Compared to older Stheno version, this model is trained on:

  • 2x the amount of creative writing samples
  • Cleaned up roleplaying samples
  • Fewer low quality samples
OpenRouter
© 2026 OpenRouter, Inc

Product

  • Chat
  • Rankings
  • Models
  • Providers
  • Pricing
  • Redeem
  • Enterprise

Company

  • About
  • Announcements
  • CareersHiring
  • Privacy
  • Terms of Service
  • Support
  • State of AI
  • Works With OR

Developer

  • Documentation
  • API Reference
  • SDK
  • Status

Connect

  • Discord
  • GitHub
  • LinkedIn
  • X
  • YouTube

Sample code and API for Llama 3 Stheno 8B v3.3 32K

OpenRouter normalizes requests and responses across providers for you.

OpenRouter provides an OpenAI-compatible completion API to 300+ models & providers that you can call directly, or using the OpenAI SDK. Additionally, some third-party SDKs are available.

In the examples below, the OpenRouter-specific headers are optional. Setting them allows your app to appear on the OpenRouter leaderboards.

Using third-party SDKs

For information about using third-party SDKs and frameworks with OpenRouter, please see our frameworks documentation.

See the Request docs for all possible fields, and Parameters for explanations of specific sampling parameters.