Browse AI Models

328 models available

/

Status:

Sort:

22B33K ctx12.3 GBcurrent

denseLegacy

from mistral_common.tokens.tokenizers.mistral import MistralTokenizer from mistral_common.protocol.instruct.messages import UserMessage from mistral_common.protocol.instruct.request import ChatCompletionRequest

OpenAI GPT-OSS 20B

21B (3.6B active)128K ctx11.8 GBfrontier

moeLegacy

GPT-OSS 20B is OpenAI's first open-weight model, a 21B-parameter mixture-of-experts model with 3.6B active parameters per token. Features configurable reasoning effort (low/medium/high), full chain-of-thought visibility, and agentic capabilities including function calling. Runs on devices with 16GB of memory using MXFP4 quantization.

InternLM InternLM 20B

20B8K ctx11.2 GBlegacy

denseLegacy

InternLM2.5 has open-sourced a 20 billion parameter base model and a chat model tailored for practical scenarios. The model has the following characteristics:

Mistral Mistral Small 3.2 24B

24B131K ctx13.4 GBcurrent

visionLegacy

Mistral-Small-3.2-24B-Instruct-2506 is a minor update of Mistral-Small-3.1-24B-Instruct-2503.

Alibaba Qwen 2.5 32B

32B131K ctx17.9 GBcurrent

denseLegacy

Qwen2.5 is the latest series of Qwen large language models. For Qwen2.5, we release a number of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters. Qwen2.5 brings the following improvements upon Qwen2:

BigCode StarCoder 7B

7B8K ctx3.9 GBlegacy

denseLegacy

StarCoder 7B is BigCode's code generation model trained on The Stack v1. Supports over 80 programming languages with fill-in-the-middle capability and 8K context window.

NousResearch Hermes 3 Llama 3.1 8B

8B0K ctx4.5 GB

denseLegacy

SanctumAI Mistral 7B Instruct v0.3

7B0K ctx3.9 GB

denseLegacy

Stabilityai stablelm 2 zephyr 1 6b

6B0K ctx3.4 GB

denseLegacy

NousResearch Hermes 2 Pro Mistral 7B

7B0K ctx3.9 GB

denseLegacy

MaziyarPanahi mistral small 3.1 24b instruct 2503 hf

24B0K ctx13.4 GB

denseLegacy

TheBloke TinyLlama 1.1B Chat v0.3

1.1B0K ctx0.6 GB

denseLegacy

Bartowski cognitivecomputations Dolphin3.0 R1 Mistral 24B

24B0K ctx13.4 GB

denseLegacy

Cohere Aya Expanse 8B

8B8K ctx4.5 GBcurrent

denseLegacy

Aya Expanse 8B is Cohere's multilingual model supporting 23 languages with strong cross-lingual transfer. Designed for global applications requiring high-quality generation across diverse languages.

Google Gemma 2 27B

27B8K ctx15.1 GBcurrent

denseLegacy

Gemma 2 27B is Google's largest Gemma 2 model, offering state-of-the-art performance among open models of similar size. Built on Gemini technology with strong reasoning, code, and multilingual capabilities.

Mistral Ministral 3 14B

14B262K ctx7.8 GBfrontier

multimodalLegacy

The largest model in the Ministral 3 family, Ministral 3 14B offers frontier capabilities and performance comparable to its larger Mistral Small 3.2 24B counterpart. A powerful and efficient language model with vision capabilities.

Mistral Mistral Small 24B

24B33K ctx13.4 GBlegacy

denseLegacy

Mistral Small 3 ( 2501 ) sets a new benchmark in the "small" Large Language Models category below 70B, boasting 24B parameters and achieving state-of-the-art capabilities comparable to larger models! This model is an instruction-fine-tuned version of the base model: Mistral-Small-24B-Base-2501.

01.AI Yi 1.5 34B

34B4K ctx19 GBcurrent

denseLegacy

🐙 GitHub • 👾 Discord • 🐤 Twitter • 💬 WeChat

Hugging Face H4 Zephyr 7B Beta

7B33K ctx3.9 GBlegacy

denseLegacy

- Model type: A 7B parameter GPT-like model fine-tuned on a mix of publicly available, synthetic datasets. - Language(s) (NLP): Primarily English - License: MIT - Finetuned from model: mistralai/Mistral-7B-v0.1

TinyLlama TinyLlama 1.1B Chat v0.6

1.1B0K ctx0.6 GB

denseLegacy

TheBloke zephyr 7B beta

7B0K ctx3.9 GB

denseLegacy

Bartowski Codestral 22B v0.1

22B0K ctx12.3 GB

denseLegacy

TheBloke SOLAR 10.7B Instruct v1.0 uncensored

10.7B0K ctx6 GB

denseLegacy

NousResearch Hermes 2 Pro Llama 3 8B

8B0K ctx4.5 GB

denseLegacy

Browse AI Models

328 models available

/

Status:

Sort:

Mistral AI Codestral 22B

22B33K ctx12.3 GBcurrent

denseLegacy

OpenAI GPT-OSS 20B

21B (3.6B active)128K ctx11.8 GBfrontier

moeLegacy

InternLM InternLM 20B

20B8K ctx11.2 GBlegacy

denseLegacy

InternLM2.5 has open-sourced a 20 billion parameter base model and a chat model tailored for practical scenarios. The model has the following characteristics:

Mistral Mistral Small 3.2 24B

24B131K ctx13.4 GBcurrent

visionLegacy

Mistral-Small-3.2-24B-Instruct-2506 is a minor update of Mistral-Small-3.1-24B-Instruct-2503.

Alibaba Qwen 2.5 32B

32B131K ctx17.9 GBcurrent

denseLegacy

BigCode StarCoder 7B

7B8K ctx3.9 GBlegacy

denseLegacy

StarCoder 7B is BigCode's code generation model trained on The Stack v1. Supports over 80 programming languages with fill-in-the-middle capability and 8K context window.

NousResearch Hermes 3 Llama 3.1 8B

8B0K ctx4.5 GB

denseLegacy

SanctumAI Mistral 7B Instruct v0.3

7B0K ctx3.9 GB

denseLegacy

Stabilityai stablelm 2 zephyr 1 6b

6B0K ctx3.4 GB

denseLegacy

NousResearch Hermes 2 Pro Mistral 7B

7B0K ctx3.9 GB

denseLegacy

MaziyarPanahi mistral small 3.1 24b instruct 2503 hf

24B0K ctx13.4 GB

denseLegacy

TheBloke TinyLlama 1.1B Chat v0.3

1.1B0K ctx0.6 GB

denseLegacy

Bartowski cognitivecomputations Dolphin3.0 R1 Mistral 24B

24B0K ctx13.4 GB

denseLegacy

Cohere Aya Expanse 8B

8B8K ctx4.5 GBcurrent

denseLegacy

Aya Expanse 8B is Cohere's multilingual model supporting 23 languages with strong cross-lingual transfer. Designed for global applications requiring high-quality generation across diverse languages.

Google Gemma 2 27B

27B8K ctx15.1 GBcurrent

denseLegacy

Mistral Ministral 3 14B

14B262K ctx7.8 GBfrontier

multimodalLegacy

Mistral Mistral Small 24B

24B33K ctx13.4 GBlegacy

denseLegacy

01.AI Yi 1.5 34B

34B4K ctx19 GBcurrent

denseLegacy