chat
Qwen 3 14B
Qwen 3 14B matches Chat and keeps a practical fit profile. It sits in the middle of the current generation mix. It fits natively with comfortable headroom. Context coverage stays within the requested workload envelope.
Decode 162.2 tok/s with ExLlamaV2.
coding
Gemma 3 27B
Gemma 3 27B is a specialized fit for Coding. It sits in the middle of the current generation mix. It fits natively with comfortable headroom. Context coverage stays within the requested workload envelope. Known distribution channels: huggingface, ollama, lm-studio.
Decode 84.1 tok/s with ExLlamaV2.
reasoning
Qwen 3 14B
Qwen 3 14B matches Reasoning and keeps a practical fit profile. It sits in the middle of the current generation mix. It fits natively with comfortable headroom. Context coverage stays within the requested workload envelope.
Decode 162.2 tok/s with ExLlamaV2.