Browse AI Models

283 models available

/

Status:

Sort:

Filtered by:

Meta Llama 4 Maverick 17B 128E

400B (17B active)1.0M ctx224 GBfrontier

moeLegacy

Llama 4 Maverick is Meta's large MoE model with 17B active parameters and 128 experts (400B total). Delivers frontier-class performance on reasoning and coding while remaining deployable on a single node.

Qwen Qwen2.5 3B Instruct

3B0K ctx1.7 GB

denseLegacy

Qwen Qwen2.5 1.5B Instruct

1.5B0K ctx0.8 GB

denseLegacy

Ggml-org SmolVLM 500M Instruct

0.5B0K ctx0.3 GB

denseLegacy

TheBloke Mistral 7B Instruct v0.2

7B0K ctx3.9 GB

denseLegacy

Unsloth DeepSeek R1 0528 Qwen3 8B

8B0K ctx4.5 GB

denseLegacy

TheBloke TinyLlama 1.1B Chat v1.0

1.1B0K ctx0.6 GB

denseLegacy

Cohere Command A 111B

111B262K ctx62.2 GBfrontier

denseLegacy

Command A is Cohere's latest flagship model with 111B parameters, designed for agentic enterprise applications. Features advanced tool use, multi-step reasoning, and retrieval-augmented generation.

Alibaba Qwen 2.5 VL 72B

72B33K ctx40.3 GBfrontier

denseLegacy

license: other license_name: qwen license_link: https://huggingface.co/Qwen/Qwen2.5-VL-72B-Instruct/blob/main/LICENSE language: - en pipeline_tag: image-text-to-text tags: - multimodal library_name: transformers

Unsloth gemma 3 27b it

27B0K ctx15.1 GB

denseLegacy

TheDrummer Gemmasutra Mini 2B v1

2B0K ctx1.1 GB

denseLegacy

Lmstudio-community Qwen3.5 9B

9B0K ctx5 GB

denseLegacy

MaziyarPanahi Mistral 7B Instruct v0.3

7B0K ctx3.9 GB

denseLegacy

Lmstudio-community gemma 3 4b it

4B0K ctx2.2 GB

denseLegacy

Ggml-org embeddinggemma 300M

0.3B0K ctx0.2 GB

denseLegacy

MaziyarPanahi Meta Llama 3 8B Instruct

8B0K ctx4.5 GB

denseLegacy

Lmstudio-community Qwen3.5 35B A3B

35B0K ctx19.6 GB

denseLegacy

Meta Llama 3.1 70B

70B128K ctx39.2 GBlegacy

denseLegacy

Llama 3.1 70B is Meta's high-capability open model with 128K context window. Excels at complex reasoning, multilingual tasks, code generation, and tool use with quality competitive with leading proprietary models.

NVIDIA Nemotron 70B

70B131K ctx39.2 GBcurrent

denseLegacy

Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA to improve the helpfulness of LLM generated responses to user queries.

Mistral AI Pixtral Large 124B

124B131K ctx69.4 GBfrontier

denseLegacy

Pixtral-Large-Instruct-2411 is a 124B multimodal model built on top of Mistral Large 2, i.e., Mistral-Large-Instruct-2407. Pixtral Large is the second model in our multimodal family and demonstrates frontier-level image understanding. Particularly, the model is able to understand documents, charts and natural images, while maintaining the leading text-only understanding of Mistral Large 2.

Unsloth DeepSeek R1 Distill Llama 8B

8B0K ctx4.5 GB

denseLegacy

Dphn Dolphin3.0 Llama3.1 8B

8B0K ctx4.5 GB

denseLegacy

MaziyarPanahi Llama 3 8B Instruct 32k v0.1

8B0K ctx4.5 GB

denseLegacy

Unsloth DeepSeek R1 Distill Qwen 1.5B

1.5B0K ctx0.8 GB

denseLegacy

Browse AI Models

283 models available

/

Status:

Sort:

Filtered by:

Meta Llama 4 Maverick 17B 128E

400B (17B active)1.0M ctx224 GBfrontier

moeLegacy

Qwen Qwen2.5 3B Instruct

3B0K ctx1.7 GB

denseLegacy

Qwen Qwen2.5 1.5B Instruct

1.5B0K ctx0.8 GB

denseLegacy

Ggml-org SmolVLM 500M Instruct

0.5B0K ctx0.3 GB

denseLegacy

TheBloke Mistral 7B Instruct v0.2

7B0K ctx3.9 GB

denseLegacy

Unsloth DeepSeek R1 0528 Qwen3 8B

8B0K ctx4.5 GB

denseLegacy

TheBloke TinyLlama 1.1B Chat v1.0

1.1B0K ctx0.6 GB

denseLegacy

Cohere Command A 111B

111B262K ctx62.2 GBfrontier

denseLegacy

Command A is Cohere's latest flagship model with 111B parameters, designed for agentic enterprise applications. Features advanced tool use, multi-step reasoning, and retrieval-augmented generation.

Alibaba Qwen 2.5 VL 72B

72B33K ctx40.3 GBfrontier

denseLegacy

Unsloth gemma 3 27b it

27B0K ctx15.1 GB

denseLegacy

TheDrummer Gemmasutra Mini 2B v1

2B0K ctx1.1 GB

denseLegacy

Lmstudio-community Qwen3.5 9B

9B0K ctx5 GB

denseLegacy