01.AI

Yi 34B Chat

Name: Yi 34B Chat
Author: 01.AI

Legacy

HuggingFace

30.2KDownloads357LikesNov 2023Released200K tokensContextYi SeriesLicense4 EntryQuality

Get started

— copy & paste to run locally

Ollama

ollama run yi-34b-chat

HuggingFace

huggingface-cli download yi-34b-chat

Quick specs

Parameters34B

Architecturedense

Context200K tokens

Modalitytext

Min RAM13.3 GB

Rec. RAM20.7 GB (Q4_K_M)

LicenseYi Series

FamilyYi

✓ Chat

About this model

- they might want nothing more than destruction itself rather then anything else from their quest after immortality (and maybe someone should tell them about modern medicine)? In any event though – one thing remains true regardless : whether or not success comes easy depends entirely upon how much effort we put into conquering whatever challenges lie ahead along with having faith deep down inside ourselves too ;) So let’s get started now shall We?" pipeline_tag: text-generation

•🤖 The Yi series models are the next generation of open-source large language models trained from scratch by 01.AI
•🙌 Targeted as a bilingual language model and trained on 3T multilingual corpus, the Yi series models become one of the strongest LLM worldwide,...
•Yi-34B-Chat model landed in second place (following GPT-4 Turbo), outperforming other LLMs (such as GPT-4, Mixtral, Claude) on the AlpacaEval...
•Yi-34B model ranked first among all existing open-source models (such as Falcon-180B, Llama-70B, Claude) in both English and Chinese on...
•🙏 (Credits to Llama) Thanks to the Transformer and Llama open-source communities, as they reduce the efforts required to build from scratch and...

Related models

Quick picks

Best budgetC

Mac mini M4 64GB~$1,099 — 4 tok/s

Best overallB

NVIDIA A100 40GB~$10,000 — 63 tok/s

Best hardware

Top picks for Yi 34B Chat

NVIDIA A100 40GBB

40 GB

RTX PRO 5000 Blackwell 48GBC

Quantization options

VRAM estimates by quant level

No hardware detected — fit column shows raw VRAM estimates

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	13.3 GB	Low	—
Q3_K_S	3	16.7 GB	Low	—
NVFP4	4	19.0 GB	Medium	—
Q4_K_M	4	20.7 GB	Medium	—
Q5_K_M	5	24.5 GB	High	—
Q6_K	6	27.9 GB	High	—
Q8_0	8	36.4 GB	Very High	—
F16	16	69.7 GB	Maximum	—

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: NVIDIA A10 24GB

Weights20.7 GB

KV Cache5.3 GB

Runtime0.9 GB

Headroom2.4 GB

Quant

Bits

VRAM

Quality

Fit

Q2_K

13.3 GB

Low

—

Q3_K_S

16.7 GB

Low

—

NVFP4

19.0 GB

Medium

—

Q4_K_M

20.7 GB

Medium

—

Q5_K_M

24.5 GB

High

—

Q6_K

27.9 GB

High

—

Q8_0

36.4 GB

Very High

—

F16

69.7 GB

Maximum

—