Moonshot AI

Kimi K2.5

Kimi family, 1000B parameters, recommended as Q4_K_M for first-pass local usage in V1.

Params

1000B

Context

256K

License

Custom

Best runtime

vLLM

Recommended hardware

First-pass fit across priority GPUs

Open calculator
HardwareFitDecodeSafe ctx
NVIDIA A10 24GBToo heavy2 tok/s4K
NVIDIA A100 40GBToo heavy3.5 tok/s4K
NVIDIA A100 80GBToo heavy4.6 tok/s4K
NVIDIA A16 64GBToo heavy2 tok/s4K