Browse AI Models

328 models available

/

Status:

Sort:

14.7B33K ctx8.2 GBfrontier

denseLegacy

> [!IMPORTANT] > To fully take advantage of the model's capabilities, inference must use `temperature=0.8`, `top_k=50`, `top_p=0.95`, and `do_sample=True`. For more complex queries, set `max_new_tokens=32768` to allow for longer chain-of-thought (CoT).

Alibaba Qwen 3 32B

32B131K ctx17.9 GBfrontier

denseLegacy

Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support, with the following key features:

MaziyarPanahi Yi Coder 1.5B Chat

1.5B0K ctx0.8 GB

denseLegacy

MaziyarPanahi gemma 3 12b it

12B0K ctx6.7 GB

denseLegacy

MaziyarPanahi Llama 3.2 1B Instruct

1B0K ctx0.6 GB

denseLegacy

MaziyarPanahi gemma 2 2b it

2B0K ctx1.1 GB

denseLegacy

MaziyarPanahi Llama 3.2 3B Instruct

3B0K ctx1.7 GB

denseLegacy

MaziyarPanahi gemma 3 1b it

1B0K ctx0.6 GB

denseLegacy

Bartowski cognitivecomputations Dolphin Mistral 24B Venice Edition

24B0K ctx13.4 GB

denseLegacy

MaziyarPanahi DeepSeek R1 0528 Qwen3 8B

8B0K ctx4.5 GB

denseLegacy

MaziyarPanahi Mistral Small 24B Instruct 2501

24B0K ctx13.4 GB

denseLegacy

MaziyarPanahi Yi 1.5 6B Chat

6B0K ctx3.4 GB

denseLegacy

Mistralai Ministral 3 3B Instruct 2512

3B0K ctx1.7 GB

denseLegacy

Cohere Command R 35B

35B131K ctx19.6 GBcurrent

denseLegacy

Command R is Cohere's retrieval-augmented generation model optimized for enterprise use. Excels at long-context document processing, tool use, and grounded generation with citation support.

Jina AI Jina Embeddings v3

0.57B8K ctx0.3 GBcurrent

denseLegacy

jina-embeddings-v3: Multilingual Embeddings With Task LoRA

Meta Llama 3.2 11B Vision

11B16K ctx6.2 GBlegacy

visionLegacy

Llama 3.2 11B Vision is Meta's multimodal model that processes both text and images. Supports visual question answering, image captioning, and document understanding alongside standard text generation.

Mistral Magistral Small 2507

24B131K ctx13.4 GBlegacy

denseLegacy

Building upon Mistral Small 3.1 (2503), with added reasoning capabilities, undergoing SFT from Magistral Medium traces and RL on top, it's a small, efficient reasoning model with 24B parameters.

Mistral Mixtral 8x7B

47B (13B active)33K ctx26.3 GBcurrent

moeLegacy

from mistral_common.tokens.tokenizers.mistral import MistralTokenizer from mistral_common.protocol.instruct.messages import UserMessage from mistral_common.protocol.instruct.request import ChatCompletionRequest

01.AI Yi 34B Chat

34B200K ctx19 GBlegacy

denseLegacy

- they might want nothing more than destruction itself rather then anything else from their quest after immortality (and maybe someone should tell them about modern medicine)? In any event though – one thing remains true regardless : whether or not success comes easy depends entirely upon how much effort we put into conquering whatever challenges lie ahead along with having faith deep down inside ourselves too ;) So let’s get started now shall We?" pipeline_tag: text-generation

MaziyarPanahi gemma 3 27b it

27B0K ctx15.1 GB

denseLegacy

MaziyarPanahi Yi Coder 9B Chat

9B0K ctx5 GB

denseLegacy

Bartowski glm 4 9b chat 1m

9B0K ctx5 GB

denseLegacy

TeichAI Qwen3 8B DeepSeek v3.2 Speciale Distill

8B0K ctx4.5 GB

denseLegacy

BAAI BGE M3

0.57B8K ctx0.3 GBcurrent

denseLegacy

For more details please refer to our github repo: https://github.com/FlagOpen/FlagEmbedding

Browse AI Models

328 models available

/

Status:

Sort:

Microsoft Phi-4-reasoning-plus 14B

14.7B33K ctx8.2 GBfrontier

denseLegacy

Alibaba Qwen 3 32B

32B131K ctx17.9 GBfrontier

denseLegacy

MaziyarPanahi Yi Coder 1.5B Chat

1.5B0K ctx0.8 GB

denseLegacy

MaziyarPanahi gemma 3 12b it

12B0K ctx6.7 GB

denseLegacy

MaziyarPanahi Llama 3.2 1B Instruct

1B0K ctx0.6 GB

denseLegacy

MaziyarPanahi gemma 2 2b it

2B0K ctx1.1 GB

denseLegacy

MaziyarPanahi Llama 3.2 3B Instruct

3B0K ctx1.7 GB

denseLegacy

MaziyarPanahi gemma 3 1b it

1B0K ctx0.6 GB

denseLegacy

Bartowski cognitivecomputations Dolphin Mistral 24B Venice Edition

24B0K ctx13.4 GB

denseLegacy

MaziyarPanahi DeepSeek R1 0528 Qwen3 8B

8B0K ctx4.5 GB

denseLegacy

MaziyarPanahi Mistral Small 24B Instruct 2501

24B0K ctx13.4 GB

denseLegacy

MaziyarPanahi Yi 1.5 6B Chat

6B0K ctx3.4 GB

denseLegacy

Mistralai Ministral 3 3B Instruct 2512

3B0K ctx1.7 GB

denseLegacy

Cohere Command R 35B

35B131K ctx19.6 GBcurrent

denseLegacy

Command R is Cohere's retrieval-augmented generation model optimized for enterprise use. Excels at long-context document processing, tool use, and grounded generation with citation support.

Jina AI Jina Embeddings v3

0.57B8K ctx0.3 GBcurrent

denseLegacy

jina-embeddings-v3: Multilingual Embeddings With Task LoRA

Meta Llama 3.2 11B Vision

11B16K ctx6.2 GBlegacy

visionLegacy

Mistral Magistral Small 2507

24B131K ctx13.4 GBlegacy

denseLegacy

Building upon Mistral Small 3.1 (2503), with added reasoning capabilities, undergoing SFT from Magistral Medium traces and RL on top, it's a small, efficient reasoning model with 24B parameters.

Mistral Mixtral 8x7B

47B (13B active)33K ctx26.3 GBcurrent

moeLegacy

01.AI Yi 34B Chat

34B200K ctx19 GBlegacy

denseLegacy

MaziyarPanahi gemma 3 27b it

27B0K ctx15.1 GB

denseLegacy

MaziyarPanahi Yi Coder 9B Chat

9B0K ctx5 GB

denseLegacy