Intel

Intel Arc A370M 4GB

Name: Intel Arc A370M 4GB
Brand: Intel

Arc A MobileLaptopAlchemistMOBILEoneAPI

4GB

VRAM

112GB/s

Bandwidth

8TFLOPS

FP16 Compute

64TOPS

INT8 Inference

Intel Arc A370M 4GBCategory AvgIntel Arc A380 6GB

Specifications

Compute

FP168 TFLOPS

INT864 TOPS

ArchitectureAlchemist

Memory

VRAM4 GB

Bandwidth112 GB/s

General

FamilyArc A Mobile

SegmentLaptop

InterconnectMOBILE

Compute PlatformONEAPI

Architecture

Alchemist

Alchemist is Intel's first discrete GPU architecture under the Arc brand, using Xe-HPG cores manufactured on TSMC's N6 process. It features XMX (Xe Matrix Extensions) engines for AI acceleration.

AI Relevance

XMX engines provide some AI inference acceleration via oneAPI/SYCL. However, the software ecosystem for LLM inference on Intel Arc is still developing, with limited runtime support compared to CUDA.

Process: TSMC N6Platform: ONEAPIPrecisions: FP32, FP16, INT8

Recommendations by Workload

Agentic Coding

Qwen 2.5 Coder 1.5B

This model is still usable for agentic-coding, but it is not the most specialized pick. It sits in the middle of the current model mix. It fits natively with comfortable headroom. Known channels: huggingface, ollama, lm-studio.

Decode 54.9 tok/s · 33K ctx · llama.cpp

3.0 GB / 4.0 GB VRAM

Chat

Qwen 3 1.7B

This model is a direct match for chat. It belongs to a current frontier family for local AI. It fits natively with comfortable headroom. Known channels: huggingface, ollama, lm-studio.

Decode 52.9 tok/s · 10K ctx · llama.cpp

3.1 GB / 4.0 GB VRAM

Coding

Qwen 2.5 Coder 1.5B

This model is still usable for coding, but it is not the most specialized pick. It sits in the middle of the current model mix. It fits natively with comfortable headroom. Known channels: huggingface, ollama, lm-studio.

Decode 54.9 tok/s · 21K ctx · llama.cpp

3.0 GB / 4.0 GB VRAM

RAG

Qwen 3 1.7B

This model is still usable for rag, but it is not the most specialized pick. It belongs to a current frontier family for local AI. It fits natively with comfortable headroom. Known channels: huggingface, ollama, lm-studio.

Decode 52.9 tok/s · 33K ctx · llama.cpp

3.1 GB / 4.0 GB VRAM

Reasoning

DeepSeek R1 1.5B

This model is a direct match for reasoning. It sits in the middle of the current model mix. It fits natively with comfortable headroom. Known channels: huggingface, ollama, lm-studio.

Decode 54.9 tok/s · 21K ctx · llama.cpp

3.0 GB / 4.0 GB VRAM

Intel Arc A370M 4GB

Specifications

Alchemist

Recommendations by Workload

Full Model Compatibility

Models you could run with an upgrade

Upgrade from Intel Arc A370M 4GB

Upgrade options

Intel Arc A370M 4GB

Specifications

Alchemist

Recommendations by Workload

Full Model Compatibility

Models you could run with an upgrade

Upgrade from Intel Arc A370M 4GB

Upgrade options