IBM

Granite 3.1 8B

Name: Granite 3.1 8B
Author: IBM

Current

HuggingFace

Ollama

105.0KDownloads168LikesDec 2024Released128K tokensContextApache 2.0License3 EntryQuality

Get started

— copy & paste to run locally

Ollama

ollama run granite-3.1-8b

HuggingFace

huggingface-cli download granite-3.1-8b

Quick specs

Parameters8B

Architecturedense

Context128K tokens

Modalitytext

Min RAM3.1 GB

Rec. RAM4.9 GB (Q4_K_M)

LicenseApache 2.0

FamilyGranite

✓ Code✓ Chat

About this model

Model Summary: Granite-3.1-8B-Instruct is a 8B parameter long-context instruct model finetuned from Granite-3.1-8B-Base using a combination of open source instruction datasets with permissive license and internally collected synthetic datasets tailored for solving long context problems. This model is developed using a diverse set of techniques with a structured chat format, including supervised finetuning, model alignment using reinforcement learning, and model merging.

•Developers:: Granite Team, IBM
•GitHub Repository:: ibm-granite/granite-3.1-language-models
•Website: Granite Docs
•Paper:: Granite 3.1 Language Models (coming soon)
•Release Date: December 18th, 2024

Related models

Quick picks

Best budgetC

Intel Arc B580 12GB~$249 — 45 tok/s

Best overallB

RTX 3080 10GB~$699 — 118 tok/s

Best hardware

Top picks for Granite 3.1 8B

Quantization options

VRAM estimates by quant level

No hardware detected — fit column shows raw VRAM estimates

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	3.1 GB	Low	—
Q3_K_S	3	3.9 GB	Low	—
NVFP4	4	4.5 GB	Medium	—
Q4_K_M	4	4.9 GB	Medium	—
Q5_K_M	5	5.8 GB	High	—
Q6_K	6	6.6 GB	High	—
Q8_0	8	8.6 GB	Very High	—
F16	16	16.4 GB	Maximum	—

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: NVIDIA A10 24GB

Weights4.9 GB

KV Cache1.3 GB

Runtime0.9 GB

Headroom2.4 GB

IBM

Granite 3.1 8B

Current

HuggingFace

Ollama

105.0KDownloads168LikesDec 2024Released128K tokensContextApache 2.0License3 EntryQuality

Get started

— copy & paste to run locally

Ollama

ollama run granite-3.1-8b

HuggingFace

huggingface-cli download granite-3.1-8b

Quick specs

Parameters8B

Architecturedense

Context128K tokens

Modalitytext

Min RAM3.1 GB

Rec. RAM4.9 GB (Q4_K_M)

LicenseApache 2.0

FamilyGranite

✓ Code✓ Chat

About this model

•Developers:: Granite Team, IBM
•GitHub Repository:: ibm-granite/granite-3.1-language-models
•Website: Granite Docs
•Paper:: Granite 3.1 Language Models (coming soon)
•Release Date: December 18th, 2024

Related models

Quick picks

Best budgetC

Intel Arc B580 12GB~$249 — 45 tok/s

Best overallB

RTX 3080 10GB~$699 — 118 tok/s

Best hardware

Top picks for Granite 3.1 8B

Quantization options

VRAM estimates by quant level

No hardware detected — fit column shows raw VRAM estimates

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	3.1 GB	Low	—
Q3_K_S	3	3.9 GB	Low	—
NVFP4	4	4.5 GB	Medium	—
Q4_K_M	4	4.9 GB	Medium	—
Q5_K_M	5	5.8 GB	High	—
Q6_K	6	6.6 GB	High	—
Q8_0	8	8.6 GB	Very High	—
F16	16	16.4 GB	Maximum	—

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: NVIDIA A10 24GB

Weights4.9 GB

KV Cache1.3 GB

Runtime0.9 GB

Headroom2.4 GB