Browse AI Models

283 models available

/

Status:

Sort:

Filtered by:

Cohere Command R 35B

35B131K ctx19.6 GBcurrent

denseLegacy

Command R is Cohere's retrieval-augmented generation model optimized for enterprise use. Excels at long-context document processing, tool use, and grounded generation with citation support.

Meta Llama 3.2 11B Vision

11B16K ctx6.2 GBlegacy

visionLegacy

Llama 3.2 11B Vision is Meta's multimodal model that processes both text and images. Supports visual question answering, image captioning, and document understanding alongside standard text generation.

Mistral Magistral Small 2507

24B131K ctx13.4 GBlegacy

denseLegacy

Building upon Mistral Small 3.1 (2503), with added reasoning capabilities, undergoing SFT from Magistral Medium traces and RL on top, it's a small, efficient reasoning model with 24B parameters.

Mistral Mixtral 8x7B

47B (13B active)33K ctx26.3 GBcurrent

moeLegacy

from mistral_common.tokens.tokenizers.mistral import MistralTokenizer from mistral_common.protocol.instruct.messages import UserMessage from mistral_common.protocol.instruct.request import ChatCompletionRequest

01.AI Yi 34B Chat

34B200K ctx19 GBlegacy

denseLegacy

- they might want nothing more than destruction itself rather then anything else from their quest after immortality (and maybe someone should tell them about modern medicine)? In any event though – one thing remains true regardless : whether or not success comes easy depends entirely upon how much effort we put into conquering whatever challenges lie ahead along with having faith deep down inside ourselves too ;) So let’s get started now shall We?" pipeline_tag: text-generation

MaziyarPanahi gemma 3 27b it

27B0K ctx15.1 GB

denseLegacy

MaziyarPanahi Yi Coder 9B Chat

9B0K ctx5 GB

denseLegacy

Bartowski glm 4 9b chat 1m

9B0K ctx5 GB

denseLegacy

TeichAI Qwen3 8B DeepSeek v3.2 Speciale Distill

8B0K ctx4.5 GB

denseLegacy

OpenAI GPT-OSS 20B

21B (3.6B active)128K ctx11.8 GBfrontier

moeLegacy

GPT-OSS 20B is OpenAI's first open-weight model, a 21B-parameter mixture-of-experts model with 3.6B active parameters per token. Features configurable reasoning effort (low/medium/high), full chain-of-thought visibility, and agentic capabilities including function calling. Runs on devices with 16GB of memory using MXFP4 quantization.

InternLM InternLM 20B

20B8K ctx11.2 GBlegacy

denseLegacy

InternLM2.5 has open-sourced a 20 billion parameter base model and a chat model tailored for practical scenarios. The model has the following characteristics:

Mistral Mistral Small 3.2 24B

24B131K ctx13.4 GBcurrent

visionLegacy

Mistral-Small-3.2-24B-Instruct-2506 is a minor update of Mistral-Small-3.1-24B-Instruct-2503.

Alibaba Qwen 2.5 32B

32B131K ctx17.9 GBcurrent

denseLegacy

Qwen2.5 is the latest series of Qwen large language models. For Qwen2.5, we release a number of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters. Qwen2.5 brings the following improvements upon Qwen2:

NousResearch Hermes 3 Llama 3.1 8B

8B0K ctx4.5 GB

denseLegacy

SanctumAI Mistral 7B Instruct v0.3

7B0K ctx3.9 GB

denseLegacy

Stabilityai stablelm 2 zephyr 1 6b

6B0K ctx3.4 GB

denseLegacy

NousResearch Hermes 2 Pro Mistral 7B

7B0K ctx3.9 GB

denseLegacy

MaziyarPanahi mistral small 3.1 24b instruct 2503 hf

24B0K ctx13.4 GB

denseLegacy

TheBloke TinyLlama 1.1B Chat v0.3

1.1B0K ctx0.6 GB

denseLegacy

Bartowski cognitivecomputations Dolphin3.0 R1 Mistral 24B

24B0K ctx13.4 GB

denseLegacy

Cohere Aya Expanse 8B

8B8K ctx4.5 GBcurrent

denseLegacy

Aya Expanse 8B is Cohere's multilingual model supporting 23 languages with strong cross-lingual transfer. Designed for global applications requiring high-quality generation across diverse languages.

Google Gemma 2 27B

27B8K ctx15.1 GBcurrent

denseLegacy

Gemma 2 27B is Google's largest Gemma 2 model, offering state-of-the-art performance among open models of similar size. Built on Gemini technology with strong reasoning, code, and multilingual capabilities.

Mistral Ministral 3 14B

14B262K ctx7.8 GBfrontier

multimodalLegacy

The largest model in the Ministral 3 family, Ministral 3 14B offers frontier capabilities and performance comparable to its larger Mistral Small 3.2 24B counterpart. A powerful and efficient language model with vision capabilities.

Mistral Mistral Small 24B

24B33K ctx13.4 GBlegacy

denseLegacy

Mistral Small 3 ( 2501 ) sets a new benchmark in the "small" Large Language Models category below 70B, boasting 24B parameters and achieving state-of-the-art capabilities comparable to larger models! This model is an instruction-fine-tuned version of the base model: Mistral-Small-24B-Base-2501.

Browse AI Models

283 models available

/

Status:

Sort:

Filtered by:

Cohere Command R 35B

35B131K ctx19.6 GBcurrent

denseLegacy

Command R is Cohere's retrieval-augmented generation model optimized for enterprise use. Excels at long-context document processing, tool use, and grounded generation with citation support.

Meta Llama 3.2 11B Vision

11B16K ctx6.2 GBlegacy

visionLegacy

Mistral Magistral Small 2507

24B131K ctx13.4 GBlegacy

denseLegacy

Building upon Mistral Small 3.1 (2503), with added reasoning capabilities, undergoing SFT from Magistral Medium traces and RL on top, it's a small, efficient reasoning model with 24B parameters.

Mistral Mixtral 8x7B

47B (13B active)33K ctx26.3 GBcurrent

moeLegacy

01.AI Yi 34B Chat

34B200K ctx19 GBlegacy

denseLegacy

MaziyarPanahi gemma 3 27b it

27B0K ctx15.1 GB

denseLegacy

MaziyarPanahi Yi Coder 9B Chat

9B0K ctx5 GB

denseLegacy

Bartowski glm 4 9b chat 1m

9B0K ctx5 GB

denseLegacy

TeichAI Qwen3 8B DeepSeek v3.2 Speciale Distill

8B0K ctx4.5 GB

denseLegacy

OpenAI GPT-OSS 20B

21B (3.6B active)128K ctx11.8 GBfrontier

moeLegacy

InternLM InternLM 20B

20B8K ctx11.2 GBlegacy

denseLegacy

InternLM2.5 has open-sourced a 20 billion parameter base model and a chat model tailored for practical scenarios. The model has the following characteristics:

Mistral Mistral Small 3.2 24B

24B131K ctx13.4 GBcurrent

visionLegacy

Mistral-Small-3.2-24B-Instruct-2506 is a minor update of Mistral-Small-3.1-24B-Instruct-2503.

Alibaba Qwen 2.5 32B

32B131K ctx17.9 GBcurrent

denseLegacy

NousResearch Hermes 3 Llama 3.1 8B

8B0K ctx4.5 GB

denseLegacy

SanctumAI Mistral 7B Instruct v0.3

7B0K ctx3.9 GB

denseLegacy

Stabilityai stablelm 2 zephyr 1 6b

6B0K ctx3.4 GB

denseLegacy

NousResearch Hermes 2 Pro Mistral 7B

7B0K ctx3.9 GB

denseLegacy

MaziyarPanahi mistral small 3.1 24b instruct 2503 hf

24B0K ctx13.4 GB

denseLegacy

TheBloke TinyLlama 1.1B Chat v0.3

1.1B0K ctx0.6 GB

denseLegacy

Bartowski cognitivecomputations Dolphin3.0 R1 Mistral 24B

24B0K ctx13.4 GB

denseLegacy

Cohere Aya Expanse 8B

8B8K ctx4.5 GBcurrent

denseLegacy

Aya Expanse 8B is Cohere's multilingual model supporting 23 languages with strong cross-lingual transfer. Designed for global applications requiring high-quality generation across diverse languages.

Google Gemma 2 27B

27B8K ctx15.1 GBcurrent

denseLegacy

Mistral Ministral 3 14B

14B262K ctx7.8 GBfrontier

multimodalLegacy

Mistral Mistral Small 24B

24B33K ctx13.4 GBlegacy

denseLegacy