Mistral

Ministral 8B

Name: Ministral 8B
Author: Mistral

Current

HuggingFace

Ollama

198.1KDownloads575LikesOct 2024Released131K tokensContextMistral ResearchLicense3 EntryQuality

Get started

— copy & paste to run locally

Ollama

ollama run ministral-8b

HuggingFace

huggingface-cli download ministral-8b

Quick specs

Parameters8B

Architecturedense

Context131K tokens

Modalitytext

Min RAM3.1 GB

Rec. RAM4.9 GB (Q4_K_M)

LicenseMistral Research

FamilyMistral

✓ Chat

About this model

We introduce two new state-of-the-art models for local intelligence, on-device computing, and at-the-edge use cases. We call them les Ministraux: Ministral 3B and Ministral 8B.

•Released under the Mistral Research License, reach out to us for a commercial license
•Trained with a 128k context window with interleaved sliding-window attention
•Trained on a large proportion of multilingual and code data
•Supports function calling
•Vocabulary size of 131k, using the V3-Tekken tokenizer

Related models

Quick picks

Best budgetC

Intel Arc B580 12GB~$249 — 45 tok/s

Best overallB

RTX 3080 10GB~$699 — 118 tok/s

Best hardware

Top picks for Ministral 8B

Quantization options

VRAM estimates by quant level

No hardware detected — fit column shows raw VRAM estimates

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	3.1 GB	Low	—
Q3_K_S	3	3.9 GB	Low	—
NVFP4	4	4.5 GB	Medium	—
Q4_K_M	4	4.9 GB	Medium	—
Q5_K_M	5	5.8 GB	High	—
Q6_K	6	6.6 GB	High	—
Q8_0	8	8.6 GB	Very High	—
F16	16	16.4 GB	Maximum	—

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: NVIDIA A10 24GB

Weights4.9 GB

KV Cache1.3 GB

Runtime0.9 GB

Headroom2.4 GB

Mistral

Ministral 8B

Current

HuggingFace

Ollama

198.1KDownloads575LikesOct 2024Released131K tokensContextMistral ResearchLicense3 EntryQuality

Get started

— copy & paste to run locally

Ollama

ollama run ministral-8b

HuggingFace

huggingface-cli download ministral-8b

Quick specs

Parameters8B

Architecturedense

Context131K tokens

Modalitytext

Min RAM3.1 GB

Rec. RAM4.9 GB (Q4_K_M)

LicenseMistral Research

FamilyMistral

✓ Chat

About this model

We introduce two new state-of-the-art models for local intelligence, on-device computing, and at-the-edge use cases. We call them les Ministraux: Ministral 3B and Ministral 8B.

•Released under the Mistral Research License, reach out to us for a commercial license
•Trained with a 128k context window with interleaved sliding-window attention
•Trained on a large proportion of multilingual and code data
•Supports function calling
•Vocabulary size of 131k, using the V3-Tekken tokenizer

Related models

Quick picks

Best budgetC

Intel Arc B580 12GB~$249 — 45 tok/s

Best overallB

RTX 3080 10GB~$699 — 118 tok/s

Best hardware

Top picks for Ministral 8B

Quantization options

VRAM estimates by quant level

No hardware detected — fit column shows raw VRAM estimates

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	3.1 GB	Low	—
Q3_K_S	3	3.9 GB	Low	—
NVFP4	4	4.5 GB	Medium	—
Q4_K_M	4	4.9 GB	Medium	—
Q5_K_M	5	5.8 GB	High	—
Q6_K	6	6.6 GB	High	—
Q8_0	8	8.6 GB	Very High	—
F16	16	16.4 GB	Maximum	—

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: NVIDIA A10 24GB

Weights4.9 GB

KV Cache1.3 GB

Runtime0.9 GB

Headroom2.4 GB