Browse AI Models

283 models available

/

Status:

Sort:

Filtered by:

Nous Research Nous Dolphin 13B

13B16K ctx7.3 GBlegacy

denseLegacy

Dolphin 13B is a general-purpose uncensored model fine-tuned for broad capabilities including coding, reasoning, and creative writing without alignment restrictions.

LMSYS Vicuna 13B

13B4K ctx7.3 GBlegacy

denseLegacy

Vicuna is a chat assistant trained by fine-tuning Llama 2 on user-shared conversations collected from ShareGPT.

WizardLM WizardLM 13B

13B8K ctx7.3 GBlegacy

denseLegacy

Project Repo: https://github.com/nlpxucan/WizardLM

Mradermacher HelpingAI 3B hindi

3B0K ctx1.7 GB

denseLegacy

Mradermacher zephyr 7b gemma sft african ultrachat 100k

7B0K ctx3.9 GB

denseLegacy

Mradermacher HelpingAI 9B i1

9B0K ctx5 GB

denseLegacy

RichardErkhov jointpreferences mistral 7b sft helpful

7B0K ctx3.9 GB

denseLegacy

Mradermacher zephyr 7b dpo full i1

7B0K ctx3.9 GB

denseLegacy

Mradermacher blossom v3 baichuan2 7b i1

7B0K ctx3.9 GB

denseLegacy

Mradermacher Helply 10.2b chat i1

10.2B0K ctx5.7 GB

denseLegacy

Mradermacher AI21 Jamba2 3B i1

3B0K ctx1.7 GB

denseLegacy

Mradermacher blossom v1 baichuan 7b i1

7B0K ctx3.9 GB

denseLegacy

Mistral Ministral 8B

8B131K ctx4.5 GBcurrent

denseLegacy

We introduce two new state-of-the-art models for local intelligence, on-device computing, and at-the-edge use cases. We call them les Ministraux: Ministral 3B and Ministral 8B.

Mradermacher BaichuanMed OCR 72B i1

72B0K ctx40.3 GB

denseLegacy

Google Gemma 2 9B

9B8K ctx5 GBcurrent

denseLegacy

Gemma 2 9B is Google's mid-size open model built on Gemini research. Features improved reasoning and safety with a novel architecture optimized for efficient inference on consumer hardware.

IBM Granite 3.1 8B

8B128K ctx4.5 GBcurrent

denseLegacy

Model Summary: Granite-3.1-8B-Instruct is a 8B parameter long-context instruct model finetuned from Granite-3.1-8B-Base using a combination of open source instruction datasets with permissive license and internally collected synthetic datasets tailored for solving long context problems. This model is developed using a diverse set of techniques with a structured chat format, including supervised finetuning, model alignment using reinforcement learning, and model merging.

LLaVA LLaVA 1.5 7B

7B4K ctx3.9 GBlegacy

denseLegacy

Model type: LLaVA is an open-source chatbot trained by fine-tuning LLaMA/Vicuna on GPT-generated multimodal instruction-following data. It is an auto-regressive language model, based on the transformer architecture.

Cognitive Computations Samantha 7B

7B4K ctx3.9 GBlegacy

denseLegacy

Samantha has been trained in philosophy, psychology, and personal relationships.

01.AI Yi 1.5 9B

9B4K ctx5 GBcurrent

denseLegacy

🐙 GitHub • 👾 Discord • 🐤 Twitter • 💬 WeChat

Cerebras Cerebras-GPT 13B

13B131K ctx7.3 GBlegacy

denseLegacy

Check out our Blog Post and arXiv paper!

Mistral Ministral 3 3B

3B262K ctx1.7 GBfrontier

multimodalLegacy

The smallest model in the Ministral 3 family, Ministral 3 3B is a powerful, efficient tiny language model with vision capabilities.

Microsoft Phi 4 Mini 4B

4B128K ctx2.2 GBfrontier

denseLegacy

Phi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered publicly available websites - with a focus on high-quality, reasoning dense data. The model belongs to the Phi-4 model family and supports 128K token context length. The model underwent an enhancement process, incorporating both supervised fine-tuning and direct preference optimization to support precise instruction adherence and robust safety measures.

Alibaba Qwen 2.5 7B

7B131K ctx3.9 GBcurrent

denseLegacy

Qwen2.5 is the latest series of Qwen large language models. For Qwen2.5, we release a number of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters. Qwen2.5 brings the following improvements upon Qwen2:

MosaicML MPT-7B-Instruct

7B8K ctx3.9 GBlegacy

denseLegacy

MPT-7B Instruct is MosaicML's instruction-tuned model with a commercially permissive license. Supports 65K context with ALiBi positional encoding for efficient long-document processing.

Browse AI Models

283 models available

/

Status:

Sort:

Filtered by:

Nous Research Nous Dolphin 13B

13B16K ctx7.3 GBlegacy

denseLegacy

Dolphin 13B is a general-purpose uncensored model fine-tuned for broad capabilities including coding, reasoning, and creative writing without alignment restrictions.

LMSYS Vicuna 13B

13B4K ctx7.3 GBlegacy

denseLegacy

Vicuna is a chat assistant trained by fine-tuning Llama 2 on user-shared conversations collected from ShareGPT.

WizardLM WizardLM 13B

13B8K ctx7.3 GBlegacy

denseLegacy

Project Repo: https://github.com/nlpxucan/WizardLM

Mradermacher HelpingAI 3B hindi

3B0K ctx1.7 GB

denseLegacy

Mradermacher zephyr 7b gemma sft african ultrachat 100k

7B0K ctx3.9 GB

denseLegacy

Mradermacher HelpingAI 9B i1

9B0K ctx5 GB

denseLegacy

RichardErkhov jointpreferences mistral 7b sft helpful

7B0K ctx3.9 GB

denseLegacy

Mradermacher zephyr 7b dpo full i1

7B0K ctx3.9 GB

denseLegacy

Mradermacher blossom v3 baichuan2 7b i1

7B0K ctx3.9 GB

denseLegacy

Mradermacher Helply 10.2b chat i1

10.2B0K ctx5.7 GB

denseLegacy

Mradermacher AI21 Jamba2 3B i1

3B0K ctx1.7 GB

denseLegacy

Mradermacher blossom v1 baichuan 7b i1

7B0K ctx3.9 GB

denseLegacy

Mistral Ministral 8B

8B131K ctx4.5 GBcurrent

denseLegacy

We introduce two new state-of-the-art models for local intelligence, on-device computing, and at-the-edge use cases. We call them les Ministraux: Ministral 3B and Ministral 8B.

Mradermacher BaichuanMed OCR 72B i1

72B0K ctx40.3 GB

denseLegacy

Google Gemma 2 9B

9B8K ctx5 GBcurrent

denseLegacy

Gemma 2 9B is Google's mid-size open model built on Gemini research. Features improved reasoning and safety with a novel architecture optimized for efficient inference on consumer hardware.

IBM Granite 3.1 8B

8B128K ctx4.5 GBcurrent

denseLegacy

LLaVA LLaVA 1.5 7B

7B4K ctx3.9 GBlegacy

denseLegacy

Cognitive Computations Samantha 7B

7B4K ctx3.9 GBlegacy

denseLegacy

Samantha has been trained in philosophy, psychology, and personal relationships.

01.AI Yi 1.5 9B

9B4K ctx5 GBcurrent

denseLegacy

🐙 GitHub • 👾 Discord • 🐤 Twitter • 💬 WeChat

Cerebras Cerebras-GPT 13B

13B131K ctx7.3 GBlegacy

denseLegacy

Check out our Blog Post and arXiv paper!

Mistral Ministral 3 3B

3B262K ctx1.7 GBfrontier

multimodalLegacy

The smallest model in the Ministral 3 family, Ministral 3 3B is a powerful, efficient tiny language model with vision capabilities.

Microsoft Phi 4 Mini 4B

4B128K ctx2.2 GBfrontier

denseLegacy

Alibaba Qwen 2.5 7B

7B131K ctx3.9 GBcurrent

denseLegacy

MosaicML MPT-7B-Instruct

7B8K ctx3.9 GBlegacy

denseLegacy

MPT-7B Instruct is MosaicML's instruction-tuned model with a commercially permissive license. Supports 65K context with ALiBi positional encoding for efficient long-document processing.