Will It Run AI
CalculatorModelsHardwareCompare
Product
  • Calculator
  • Compare
  • Tier List
Browse
  • Models
  • Hardware
  • Docs
About
  • Why It Works
  • What's New
  • Legal Notice
  • Privacy Policy

All estimates are approximations based on mathematical models and public specifications. Actual performance may vary. Do not make purchasing decisions based solely on these estimates.

Data sourced from Hugging Face, Ollama, and official model documentation. Model names and logos are trademarks of their respective owners.

© 2026 Will It Run AI — Fase Consulting Ibiza, S.L. (NIF: B57969656)

Home/Devstral 2 123B Instruct/on AMD Instinct MI325X 256GB

Can it run?

Can AMD Instinct MI325X 256GB run Devstral 2 123B Instruct?

CUsable

Runs well

Using Q4_K_M in vLLM

Capabilities:

Fit status

Runs well

Decode

58.4 tok/s

TTFT

3316 ms

Safe context

34K

Memory

122.2 GB / 256.0 GB

Memory breakdown

Weights75.0 GB
KV Cache19.2 GB
Runtime2.4 GB
Headroom25.6 GB

Performance by workload

WorkloadGradeFitDecodeTTFTContext
Agentic CodingCRuns well58.4 tok/s4824 ms58K
ChatCRuns well58.4 tok/s1809 ms18K
CodingCRuns well58.4 tok/s3316 ms34K
RAGCRuns well58.4 tok/s6030 ms58K
ReasoningCRuns well58.4 tok/s3919 ms34K

Quantization options

How Devstral 2 123B Instruct (123B params) fits at each quantization level on AMD Instinct MI325X 256GB (256.0 GB usable).

QuantBitsVRAMQualityFit
Q2_K
2
48.0 GB
LowD34
Q3_K_S
3
60.3 GB
LowD35
NVFP4
4
68.9 GB
MediumD35
Q4_K_M
4
75.0 GB
MediumD36
Q5_K_M
5
88.6 GB
HighD37
Q6_K
6
100.9 GB
HighD38
Q8_0Best for your GPU
8
131.6 GB
Very HighC41
F16
16
252.2 GB
MaximumC45

Get started

HuggingFace
huggingface-cli download devstral-2-123b

Upgrade options

Hardware that runs Devstral 2 123B Instruct well

AMDAMD Instinct MI350X 288GBBudget pick
C77.8 tok/s decode

~$8,000 MSRP

See all results for AMD Instinct MI325X 256GBSee all hardware for Devstral 2 123B Instruct