Browse AI Models

12 models available

/

Status:

Sort:

Filtered by:

Mistral Devstral 2 123B Instruct

123B256K ctx68.9 GBfrontier

denseLegacy

Devstral is an agentic LLM for software engineering tasks. Devstral 2 excels at using tools to explore codebases, editing multiple files and power software engineering agents. The model achieves remarkable performance on SWE-bench.

Mistral Mistral Large 3

675B (41B active)256K ctx378 GBfrontier

moeLegacy

Mistral-Large-Instruct-2411 is an advanced dense Large Language Model (LLM) of 123B parameters with state-of-the-art reasoning, knowledge and coding capabilities extending Mistral-Large-Instruct-2407 with better Long Context, Function Calling and System Prompt.

Alibaba Qwen3-Coder 30B A3B Instruct

30.5B (3.3B active)256K ctx17.1 GBfrontier

moeLegacy

Qwen3-Coder is available in multiple sizes. Today, we're excited to introduce Qwen3-Coder-30B-A3B-Instruct. This streamlined model maintains impressive performance and efficiency, featuring the following key enhancements:

Alibaba Qwen3-Coder 480B A35B Instruct

480B (35B active)256K ctx268.8 GBfrontier

moeLegacy

Today, we're announcing Qwen3-Coder, our most agentic code model to date. Qwen3-Coder is available in multiple sizes, but we're excited to introduce its most powerful variant first: Qwen3-Coder-480B-A35B-Instruct. featuring the following key enhancements:

Alibaba Qwen3-Coder-Next

80B (3B active)256K ctx44.8 GBfrontier

moeLegacy

Today, we're announcing Qwen3-Coder-Next, an open-weight language model designed specifically for coding agents and local development. It features the following key enhancements:

Alibaba Qwen3-VL 30B A3B Instruct

30B (3B active)256K ctx16.8 GBfrontier

moeLegacy

Meet Qwen3-VL — the most powerful vision-language model in the Qwen series to date.

Mistral Devstral Small 2 24B Instruct

24B256K ctx13.4 GBfrontier

denseLegacy

Devstral is an agentic LLM for software engineering tasks. Devstral Small 2 excels at using tools to explore codebases, editing multiple files and power software engineering agents. The model achieves remarkable performance on SWE-bench.

Mistral Devstral Small 1.1

24B131K ctx13.4 GBcurrent

denseLegacy

Devstral is an agentic LLM for software engineering tasks built under a collaboration between Mistral AI and All Hands AI 🙌. Devstral excels at using tools to explore codebases, editing multiple files and power software engineering agents. The model achieves remarkable performance on SWE-bench which positions it as the #1 open source model on this benchmark.

OpenAI GPT-OSS 20B

21B (3.6B active)128K ctx11.8 GBfrontier

moeLegacy

GPT-OSS 20B is OpenAI's first open-weight model, a 21B-parameter mixture-of-experts model with 3.6B active parameters per token. Features configurable reasoning effort (low/medium/high), full chain-of-thought visibility, and agentic capabilities including function calling. Runs on devices with 16GB of memory using MXFP4 quantization.

Mistral Ministral 3 14B

14B262K ctx7.8 GBfrontier

multimodalLegacy

The largest model in the Ministral 3 family, Ministral 3 14B offers frontier capabilities and performance comparable to its larger Mistral Small 3.2 24B counterpart. A powerful and efficient language model with vision capabilities.

Mistral Ministral 3 8B

8B262K ctx4.5 GBfrontier

multimodalLegacy

A balanced model in the Ministral 3 family, Ministral 3 8B is a powerful, efficient tiny language model with vision capabilities.

Mistral Ministral 3 3B

3B262K ctx1.7 GBfrontier

multimodalLegacy

The smallest model in the Ministral 3 family, Ministral 3 3B is a powerful, efficient tiny language model with vision capabilities.

Browse AI Models

12 models available

/

Status:

Sort:

Filtered by:

Mistral Devstral 2 123B Instruct

123B256K ctx68.9 GBfrontier

denseLegacy

Mistral Mistral Large 3

675B (41B active)256K ctx378 GBfrontier

moeLegacy

Alibaba Qwen3-Coder 30B A3B Instruct

30.5B (3.3B active)256K ctx17.1 GBfrontier

moeLegacy

Alibaba Qwen3-Coder 480B A35B Instruct

480B (35B active)256K ctx268.8 GBfrontier

moeLegacy

Alibaba Qwen3-Coder-Next

80B (3B active)256K ctx44.8 GBfrontier

moeLegacy

Today, we're announcing Qwen3-Coder-Next, an open-weight language model designed specifically for coding agents and local development. It features the following key enhancements:

Alibaba Qwen3-VL 30B A3B Instruct

30B (3B active)256K ctx16.8 GBfrontier

moeLegacy

Meet Qwen3-VL — the most powerful vision-language model in the Qwen series to date.

Mistral Devstral Small 2 24B Instruct

24B256K ctx13.4 GBfrontier

denseLegacy

Mistral Devstral Small 1.1

24B131K ctx13.4 GBcurrent

denseLegacy

OpenAI GPT-OSS 20B

21B (3.6B active)128K ctx11.8 GBfrontier