Microsoft

Phi-4-reasoning-plus 14B

Name: Phi-4-reasoning-plus 14B
Author: Microsoft

Frontier

HuggingFace

Ollama

4.7KDownloads335LikesApr 2025Released33K tokensContextMITLicense4 EntryQuality

Get started

— copy & paste to run locally

Ollama

ollama run phi-4-reasoning-plus-14b

HuggingFace

huggingface-cli download phi-4-reasoning-plus-14b

Quick specs

Parameters14.7B

Architecturedense

Context33K tokens

Modalitytext

Min RAM5.7 GB

Rec. RAM9 GB (Q4_K_M)

LicenseMIT

FamilyPhi

✓ Chat✓ Reasoning

About this model

> [!IMPORTANT] > To fully take advantage of the model's capabilities, inference must use `temperature=0.8`, `top_k=50`, `top_p=0.95`, and `do_sample=True`. For more complex queries, set `max_new_tokens=32768` to allow for longer chain-of-thought (CoT).

•AIME 2025, 2024, 2023, and 2022:: Math olympiad questions
•GPQA-Diamond:: Complex, graduate-level science questions
•OmniMath:: Collection of over 4000 olympiad-level math problems with human annotation
•LiveCodeBench:: Code generation benchmark gathered from competitive coding contests
•3SAT (3-literal Satisfiability Problem) and TSP (Traveling Salesman Problem):: Algorithmic problem solving

Related models

Quick picks

Best budgetC

RX 7600 XT 16GB~$329 — 19 tok/s

Best overallB

RX 7900 XT 20GB~$899 — 54 tok/s

Best hardware

Top picks for Phi-4-reasoning-plus 14B

RTX 5090 Laptop 24GBC

24 GB

NVIDIA A30 24GBC

24 GB

Quantization options

VRAM estimates by quant level

No hardware detected — fit column shows raw VRAM estimates

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	5.7 GB	Low	—
Q3_K_S	3	7.2 GB	Low	—
NVFP4	4	8.2 GB	Medium	—
Q4_K_M	4	9.0 GB	Medium	—
Q5_K_M	5	10.6 GB	High	—
Q6_K	6	12.1 GB	High	—
Q8_0	8	15.7 GB	Very High	—
F16	16	30.1 GB	Maximum	—

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: NVIDIA A10 24GB

Weights9.0 GB

KV Cache2.3 GB

Runtime0.9 GB

Headroom2.4 GB

Microsoft

Phi-4-reasoning-plus 14B

Frontier

HuggingFace

Ollama

4.7KDownloads335LikesApr 2025Released33K tokensContextMITLicense4 EntryQuality

Get started

— copy & paste to run locally

Ollama

ollama run phi-4-reasoning-plus-14b

HuggingFace

huggingface-cli download phi-4-reasoning-plus-14b

Quick specs

Parameters14.7B

Architecturedense

Context33K tokens

Modalitytext

Min RAM5.7 GB

Rec. RAM9 GB (Q4_K_M)

LicenseMIT

FamilyPhi

✓ Chat✓ Reasoning

About this model

•AIME 2025, 2024, 2023, and 2022:: Math olympiad questions
•GPQA-Diamond:: Complex, graduate-level science questions
•OmniMath:: Collection of over 4000 olympiad-level math problems with human annotation
•LiveCodeBench:: Code generation benchmark gathered from competitive coding contests
•3SAT (3-literal Satisfiability Problem) and TSP (Traveling Salesman Problem):: Algorithmic problem solving

Related models

Quick picks

Best budgetC

RX 7600 XT 16GB~$329 — 19 tok/s

Best overallB

RX 7900 XT 20GB~$899 — 54 tok/s

Best hardware

Top picks for Phi-4-reasoning-plus 14B

RTX 5090 Laptop 24GBC

24 GB

NVIDIA A30 24GBC

24 GB

Quantization options

VRAM estimates by quant level

No hardware detected — fit column shows raw VRAM estimates

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	5.7 GB	Low	—
Q3_K_S	3	7.2 GB	Low	—
NVFP4	4	8.2 GB	Medium	—
Q4_K_M	4	9.0 GB	Medium	—
Q5_K_M	5	10.6 GB	High	—
Q6_K	6	12.1 GB	High	—
Q8_0	8	15.7 GB	Very High	—
F16	16	30.1 GB	Maximum	—

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: NVIDIA A10 24GB

Weights9.0 GB

KV Cache2.3 GB

Runtime0.9 GB

Headroom2.4 GB