Mistral's official instruct fine-tuned version of Mixtral 8x22B. It uses 39B active parameters out of 141B, offering unparalleled cost efficiency for its size. Its strengths include:
- strong math, coding, and reasoning
- large context length (64k)
- fluency in English, French, Italian, German, and Spanish
See benchmarks on the launch announcement here(opens in new tab). #moe
Modalities
In / Out Price
$2 / $6per 1M
Context
66K
Weekly Rank
#277on OpenRouter
Knowledge Cutoff
Jan 31, 2024
