Alibaba

Qwen3-Coder 30B A3B Instruct

Qwen Coder family, 30.5B parameters, recommended as Q4_K_M for first-pass local usage in V1.

Params

30.5B

Context

256K

License

Apache 2.0

Best runtime

Ollama

Recommended hardware

First-pass fit across priority GPUs

Open calculator
HardwareFitDecodeSafe ctx
NVIDIA A10 24GBTight fit39.7 tok/s17K
NVIDIA A100 40GBRuns well142.9 tok/s27K
NVIDIA A100 80GBRuns well187.4 tok/s46K
NVIDIA A16 64GBRuns well55.2 tok/s39K