StripedHyena Hessian 7B (base)


Updated Dec 932,768 context
$0.18/M input tkns$0.18/M output tkns

This is the base model variant of the StripedHyena series, developed by Together.

StripedHyena uses a new architecture that competes with traditional Transformers, particularly in long-context data processing. It combines attention mechanisms with gated convolutions for improved speed, efficiency, and scaling. This model marks an advancement in AI architecture for sequence modeling tasks.