Question 1

Can GTX 1650 4GB run mxbai Embed Large?

Accepted Answer

Yes, GTX 1650 4GB can run mxbai Embed Large with a B grade (Runs well). Expected decode speed: 92.2 tok/s.

Question 2

How much VRAM does mxbai Embed Large need?

Accepted Answer

mxbai Embed Large (0.33500000834465027B parameters) requires approximately 2.6 GB of memory with F16 quantization.

Question 3

What is the best quantization for mxbai Embed Large?

Accepted Answer

The recommended quantization for mxbai Embed Large is F16, which balances quality and memory efficiency.

Workload	Grade	Fit	Decode	TTFT	Context
Agentic Coding	C	Runs well	64.0 tok/s	4400 ms	512
Chat	C	Runs well	64.0 tok/s	1650 ms	512
Coding	B	Runs well	92.2 tok/s	2101 ms	512
RAG	C	Runs well	64.0 tok/s	5500 ms	512
Reasoning	C	Runs well	64.0 tok/s	3575 ms	512

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	0.1 GB	Low	D30
Q3_K_S	3	0.2 GB	Low	D30
NVFP4	4

Can GTX 1650 4GB run mxbai Embed Large?

Memory breakdown