Will It Run AI
CalculatorModelsHardwareCompare
Product
  • Calculator
  • Compare
  • Tier List
Browse
  • Models
  • Hardware
  • Docs
About
  • Why It Works
  • What's New
  • Legal Notice
  • Privacy Policy

All estimates are approximations based on mathematical models and public specifications. Actual performance may vary. Do not make purchasing decisions based solely on these estimates.

Data sourced from Hugging Face, Ollama, and official model documentation. Model names and logos are trademarks of their respective owners.

© 2026 Will It Run AI — Fase Consulting Ibiza, S.L. (NIF: B57969656)

Home/Mixtral 8x7B/on RTX PRO 5000 Blackwell 48GB

Can it run?

Can RTX PRO 5000 Blackwell 48GB run Mixtral 8x7B?

BGood

Runs well

Using Q4_K_M in Ollama

Capabilities:

Fit status

Runs well

Decode

75.5 tok/s

TTFT

2563 ms

Safe context

21K

Memory

36.7 GB / 48.0 GB

Memory breakdown

Weights28.7 GB
KV Cache2.0 GB
Runtime1.2 GB
Headroom4.8 GB

Performance by workload

WorkloadGradeFitDecodeTTFTContext
Agentic CodingBRuns well75.5 tok/s3728 ms33K
ChatBRuns well75.5 tok/s1398 ms11K
CodingBRuns well75.5 tok/s2563 ms21K
RAGBRuns well75.5 tok/s4660 ms33K
ReasoningBRuns well75.5 tok/s3029 ms21K

Quantization options

How Mixtral 8x7B (47B params) fits at each quantization level on RTX PRO 5000 Blackwell 48GB (48.0 GB usable).

QuantBitsVRAMQualityFit
Q2_K
2
18.3 GB
LowD37
Q3_K_S
3
23.0 GB
LowD40
NVFP4
4
26.3 GB
MediumC41
Q4_K_M
4
28.7 GB
MediumC42
Q5_K_MBest for your GPU
5
33.8 GB
HighC44
Q6_K
6
38.5 GB
HighC44
Q8_0
8
50.3 GB
Very HighF0
F16
16
96.4 GB
MaximumF0

Get started

Ollama
ollama run mixtral-8x7b
HuggingFace
huggingface-cli download mixtral-8x7b
See all results for RTX PRO 5000 Blackwell 48GBSee all hardware for Mixtral 8x7B