Mistral

Devstral Small 2 24B Instruct

Devstral Small family, 24B parameters, recommended as Q4_K_M for first-pass local usage in V1.

Params

24B

Context

256K

License

Mistral Research License

Best runtime

Ollama

Recommended hardware

First-pass fit across priority GPUs

Open calculator
HardwareFitDecodeSafe ctx
NVIDIA A10 24GBTight fit23.5 tok/s18K
NVIDIA A100 40GBRuns well84.5 tok/s28K
NVIDIA A100 80GBRuns well110.7 tok/s48K
NVIDIA A16 64GBRuns well32.6 tok/s41K