Will It Run AI
CalculatorModelsHardwareCompare
Product
  • Calculator
  • Compare
  • Tier List
Browse
  • Models
  • Hardware
  • Docs
About
  • Why It Works
  • What's New
  • Legal Notice
  • Privacy Policy

All estimates are approximations based on mathematical models and public specifications. Actual performance may vary. Do not make purchasing decisions based solely on these estimates.

Data sourced from Hugging Face, Ollama, and official model documentation. Model names and logos are trademarks of their respective owners.

© 2026 Will It Run AI — Fase Consulting Ibiza, S.L. (NIF: B57969656)

Home/Qwen3.5 27B/on Mac Studio M2 Ultra 128GB

Can it run?

Can Mac Studio M2 Ultra 128GB run Qwen3.5 27B?

CUsable

Runs well

Using Q4_K_M in Ollama

Capabilities:

Fit status

Runs well

Decode

28.2 tok/s

TTFT

6872 ms

Safe context

41K

Memory

35.7 GB / 92.2 GB

Memory breakdown

Weights16.5 GB
KV Cache4.2 GB
Runtime1.2 GB
Headroom13.8 GB

Performance by workload

WorkloadGradeFitDecodeTTFTContext
Agentic CodingCRuns well28.2 tok/s9996 ms74K
ChatCRuns well28.2 tok/s3748 ms22K
CodingCRuns well28.2 tok/s6872 ms41K
RAGCRuns well28.2 tok/s12494 ms74K
ReasoningCRuns well28.2 tok/s8121 ms41K

Quantization options

How Qwen3.5 27B (27B params) fits at each quantization level on Mac Studio M2 Ultra 128GB (92.2 GB usable).

QuantBitsVRAMQualityFit
Q2_K
2
10.5 GB
LowD32
Q3_K_S
3
13.2 GB
LowD33
NVFP4
4
15.1 GB
MediumD33
Q4_K_M
4
16.5 GB
MediumD33
Q5_K_M
5
19.4 GB
HighD34
Q6_K
6
22.1 GB
HighD35
Q8_0
8
28.9 GB
Very HighD36
F16Best for your GPU
16
55.4 GB
MaximumC42

Get started

HuggingFace
huggingface-cli download hf-unsloth--qwen3-5-27b-gguf

Upgrade options

Hardware that runs Qwen3.5 27B well

AMDAMD Instinct MI250X 128GBBudget pick
C151.5 tok/s decode

~$15,000 MSRP

AMDAMD Instinct MI300X 192GBBest value
C251 tok/s decode

~$15,000 MSRP

NVIDIANVIDIA GH200 96GBBiggest leap
C196.7 tok/s decode

 

See all results for Mac Studio M2 Ultra 128GBSee all hardware for Qwen3.5 27B