Mistral

Mistral Small 4 119B

Mistral Small family, 119B parameters, recommended as Q4_K_M for first-pass local usage in V1.

Params

119B

Context

256K

License

Mistral Research License

Best runtime

vLLM

Recommended hardware

First-pass fit across priority GPUs

Open calculator
HardwareFitDecodeSafe ctx
NVIDIA A10 24GBToo heavy10.9 tok/s5K
NVIDIA A100 40GBToo heavy28.3 tok/s8K
NVIDIA A100 80GBRuns with offload37.2 tok/s16K
NVIDIA A16 64GBToo heavy10.9 tok/s13K