How much VRAM does Codestral 22B v0.1 need?

Codestral 22B v0.1 (22B parameters) requires approximately 21.3 GB of memory with Q4_K_M quantization.

What is the best quantization for Codestral 22B v0.1?

The recommended quantization for Codestral 22B v0.1 is Q4_K_M, which balances quality and memory efficiency.

Can it run?

Can AMD Instinct MI100 32GB run Codestral 22B v0.1?

Q: Can AMD Instinct MI100 32GB run Codestral 22B v0.1?

Yes, AMD Instinct MI100 32GB can run Codestral 22B v0.1 with a C grade (Runs well). Expected decode speed: 59.5 tok/s.

CUsable

Runs well

Using Q4_K_M in Ollama

Capabilities:

Fit status

Runs well

Decode

59.5 tok/s

TTFT

3255 ms

Safe context

24K

Memory

21.3 GB / 32.0 GB

Memory breakdown

Weights13.4 GB

KV Cache3.4 GB

Runtime1.2 GB

Headroom3.2 GB

Performance by workload

Workload	Grade	Fit	Decode	TTFT	Context
Agentic Coding	B	Runs well	59.5 tok/s	4734 ms	41K
Chat	C	Runs well	59.5 tok/s	1775 ms	13K
Coding	C	Runs well	59.5 tok/s	3255 ms	24K
RAG	B	Runs well	59.5 tok/s	5918 ms	41K
Reasoning	C	Runs well	59.5 tok/s	3847 ms	24K

Quantization options

How Codestral 22B v0.1 (22B params) fits at each quantization level on AMD Instinct MI100 32GB (32.0 GB usable).

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	8.6 GB	Low	D35
Q3_K_S	3	10.8 GB	Low	D36
NVFP4	4	12.3 GB	Medium	D37
Q4_K_M	4	13.4 GB	Medium	D38
Q5_K_M	5	15.8 GB	High	D40
Q6_K	6	18.0 GB	High	C41
Q8_0Best for your GPU	8	23.5 GB	Very High	C44
F16	16	45.1 GB	Maximum	F0

Get started

See all results for AMD Instinct MI100 32GB See all hardware for Codestral 22B v0.1

Can it run?

Can AMD Instinct MI100 32GB run Codestral 22B v0.1?

CUsable

Runs well

Using Q4_K_M in Ollama

Capabilities:

Fit status

Runs well

Decode

59.5 tok/s

TTFT

3255 ms

Safe context

24K

Memory

21.3 GB / 32.0 GB

Memory breakdown

Weights13.4 GB

KV Cache3.4 GB

Runtime1.2 GB

Headroom3.2 GB

Performance by workload

Workload	Grade	Fit	Decode	TTFT	Context
Agentic Coding	B	Runs well	59.5 tok/s	4734 ms	41K
Chat	C	Runs well	59.5 tok/s	1775 ms	13K
Coding	C	Runs well	59.5 tok/s	3255 ms	24K
RAG	B	Runs well	59.5 tok/s	5918 ms	41K
Reasoning	C	Runs well	59.5 tok/s	3847 ms	24K

Quantization options

How Codestral 22B v0.1 (22B params) fits at each quantization level on AMD Instinct MI100 32GB (32.0 GB usable).

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	8.6 GB	Low	D35
Q3_K_S	3	10.8 GB	Low	D36
NVFP4	4	12.3 GB	Medium	D37
Q4_K_M	4	13.4 GB	Medium	D38
Q5_K_M	5	15.8 GB	High	D40
Q6_K	6	18.0 GB	High	C41
Q8_0Best for your GPU	8	23.5 GB	Very High	C44
F16	16	45.1 GB	Maximum	F0

Get started

See all results for AMD Instinct MI100 32GB See all hardware for Codestral 22B v0.1