Will It Run AI
CalculatorModelsHardwareCompare
Product
  • Calculator
  • Compare
  • Tier List
Browse
  • Models
  • Hardware
  • Docs
About
  • Why It Works
  • What's New
  • Legal Notice
  • Privacy Policy

All estimates are approximations based on mathematical models and public specifications. Actual performance may vary. Do not make purchasing decisions based solely on these estimates.

Data sourced from Hugging Face, Ollama, and official model documentation. Model names and logos are trademarks of their respective owners.

© 2026 Will It Run AI — Fase Consulting Ibiza, S.L. (NIF: B57969656)

Home/Qwen3.5 35B A3B/on MacBook Pro M4 Max 128GB

Can it run?

Can MacBook Pro M4 Max 128GB run Qwen3.5 35B A3B?

CUsable

Runs well

Using Q4_K_M in Ollama

Capabilities:

Fit status

Runs well

Decode

16.1 tok/s

TTFT

12016 ms

Safe context

35K

Memory

41.8 GB / 92.2 GB

Memory breakdown

Weights21.3 GB
KV Cache5.5 GB
Runtime1.2 GB
Headroom13.8 GB

Performance by workload

WorkloadGradeFitDecodeTTFTContext
Agentic CodingCRuns well16.1 tok/s17478 ms62K
ChatCRuns well16.1 tok/s6554 ms19K
CodingCRuns well16.1 tok/s12016 ms35K
RAGCRuns well16.1 tok/s21848 ms62K
ReasoningCRuns well16.1 tok/s14201 ms35K

Quantization options

How Qwen3.5 35B A3B (35B params) fits at each quantization level on MacBook Pro M4 Max 128GB (92.2 GB usable).

QuantBitsVRAMQualityFit
Q2_K
2
13.7 GB
LowD33
Q3_K_S
3
17.2 GB
LowD34
NVFP4
4
19.6 GB
MediumD34
Q4_K_M
4
21.3 GB
MediumD35
Q5_K_M
5
25.2 GB
HighD35
Q6_K
6
28.7 GB
HighD36
Q8_0
8
37.5 GB
Very HighD38
F16Best for your GPU
16
71.8 GB
MaximumC45

Get started

HuggingFace
huggingface-cli download hf-unsloth--qwen3-5-35b-a3b-gguf

Upgrade options

Hardware that runs Qwen3.5 35B A3B well

AMDAMD Instinct MI350X 288GBBudget pick
C273.5 tok/s decode

~$8,000 MSRP

AMDAMD Instinct MI250X 128GBBest value
C116.9 tok/s decode

~$15,000 MSRP

NVIDIANVIDIA GH200 96GBBiggest leap
C151.8 tok/s decode

 

See all results for MacBook Pro M4 Max 128GBSee all hardware for Qwen3.5 35B A3B