Will It Run AI
CalculatorModelsHardwareCompare
Product
  • Calculator
  • Compare
  • Tier List
Browse
  • Models
  • Hardware
  • Docs
About
  • Why It Works
  • What's New
  • Legal Notice
  • Privacy Policy

All estimates are approximations based on mathematical models and public specifications. Actual performance may vary. Do not make purchasing decisions based solely on these estimates.

Data sourced from Hugging Face, Ollama, and official model documentation. Model names and logos are trademarks of their respective owners.

© 2026 Will It Run AI — Fase Consulting Ibiza, S.L. (NIF: B57969656)

Home/Models/Yi 34B Chat

01.AI01.AI

Yi 34B Chat

Legacy
huggingfaceHuggingFace
30.2KDownloads357LikesNov 2023Released200K tokensContextYi SeriesLicense4 EntryQuality

Get started

— copy & paste to run locally
Ollama
ollama run yi-34b-chat
HuggingFace
huggingface-cli download yi-34b-chat

Quick specs

Parameters34B
Architecturedense
Context200K tokens
Modalitytext
Min RAM13.3 GB
Rec. RAM20.7 GB (Q4_K_M)
LicenseYi Series
FamilyYi
✓ Chat

About this model

- they might want nothing more than destruction itself rather then anything else from their quest after immortality (and maybe someone should tell them about modern medicine)? In any event though – one thing remains true regardless : whether or not success comes easy depends entirely upon how much effort we put into conquering whatever challenges lie ahead along with having faith deep down inside ourselves too ;) So let’s get started now shall We?" pipeline_tag: text-generation

  • •🤖 The Yi series models are the next generation of open-source large language models trained from scratch by 01.AI
  • •🙌 Targeted as a bilingual language model and trained on 3T multilingual corpus, the Yi series models become one of the strongest LLM worldwide,...
  • •Yi-34B-Chat model landed in second place (following GPT-4 Turbo), outperforming other LLMs (such as GPT-4, Mixtral, Claude) on the AlpacaEval...
  • •Yi-34B model ranked first among all existing open-source models (such as Falcon-180B, Llama-70B, Claude) in both English and Chinese on...
  • •🙏 (Credits to Llama) Thanks to the Transformer and Llama open-source communities, as they reduce the efforts required to build from scratch and...

Related models

Your hardware

Detecting...

Quick picks

Apple
Best budgetC
Mac mini M4 64GB~$1,099 — 4 tok/s
NVIDIA
Best overallB
NVIDIA A100 40GB~$10,000 — 63 tok/s

Best hardware

Top picks for Yi 34B Chat

NVIDIA
NVIDIA A100 40GBB
40 GB
NVIDIA
RTX PRO 5000 Blackwell 48GBC
48 GB
NVIDIA
RTX 6000 Ada 48GBC
48 GB
NVIDIA
NVIDIA L40 48GBC
48 GB
NVIDIA
NVIDIA L40S 48GBC
48 GB

Quantization options

VRAM estimates by quant level

No hardware detected — fit column shows raw VRAM estimates

QuantBitsVRAMQualityFit
Q2_K
2
13.3 GB
Low—
Q3_K_S
3
16.7 GB
Low—
NVFP4
4
19.0 GB
Medium—
Q4_K_M
4
20.7 GB
Medium—
Q5_K_M
5
24.5 GB
High—
Q6_K
6
27.9 GB
High—
Q8_0
8
36.4 GB
Very High—
F16
16
69.7 GB
Maximum—

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: NVIDIA A10 24GB

Weights20.7 GB
KV Cache5.3 GB
Runtime0.9 GB
Headroom2.4 GB