Agentic Coding
CGranite 3.1 8B
This model is still usable for agentic-coding, but it is not the most specialized pick. It sits in the middle of the current model mix. It fits natively with comfortable headroom. Known channels: huggingface, ollama.
Apple
MacBook Pro M3 Pro 18GB with 18 GB unified memory. Third-generation Apple Silicon built on 3nm process with dynamic caching GPU architecture, significantly improving AI inference efficiency.
Architecture
Apple M3 is built on TSMC's 3nm process, the first consumer chips at this node. It introduces Dynamic Caching for more efficient GPU memory allocation and hardware-accelerated ray tracing.
AI Relevance
Dynamic Caching improves GPU utilization for compute workloads including ML inference. The M3 Ultra with up to 512 GB unified memory can theoretically hold even unquantized 70B models, though memory bandwidth remains the throughput bottleneck.
M3's dynamic caching GPU architecture allocates local memory in hardware in real-time, improving GPU utilization for AI workloads. The M3 Max reaches 400 GB/s bandwidth, competitive with mid-range discrete GPUs.
Agentic Coding
CThis model is still usable for agentic-coding, but it is not the most specialized pick. It sits in the middle of the current model mix. It fits natively with comfortable headroom. Known channels: huggingface, ollama.
Chat
CThis model is a direct match for chat. It belongs to a current frontier family for local AI. It fits natively with comfortable headroom. Known channels: huggingface, ollama, lm-studio.
Coding
CThis model is still usable for coding, but it is not the most specialized pick. It sits in the middle of the current model mix. It fits natively with comfortable headroom. Known channels: huggingface, ollama.
RAG
CThis model is a direct match for rag. It sits in the middle of the current model mix. It fits natively with comfortable headroom.
Reasoning
CThis model is a direct match for reasoning. It belongs to a current frontier family for local AI. It fits natively with comfortable headroom. Known channels: huggingface, ollama, lm-studio.
Just out of reach
High-quality models that need a bit more memory
Upgrade paths
See what you unlock with more powerful hardware
Upgrade options
~$329 MSRP
~$1,099 MSRP
~$8,000 MSRP