Browse AI Models

328 models available

/

Status:

Sort:

14B128K ctx7.8 GBcurrent

denseLegacy

The Phi-3-Medium-128K-Instruct is a 14B parameters, lightweight, state-of-the-art open model trained with the Phi-3 datasets that includes both synthetic data and the filtered publicly available websites data with a focus on high-quality and reasoning dense properties. The model belongs to the Phi-3 family with the Medium version in two variants 4k and 128K which is the context length (in tokens) that it can support.

Microsoft Phi-4 14B

14B16K ctx7.8 GBcurrent

denseLegacy

Our training data is an extension of the data used for Phi-3 and includes a wide variety of sources from:

Alibaba Qwen 2.5 VL 7B

7B33K ctx3.9 GBcurrent

denseLegacy

license: apache-2.0 language: - en pipeline_tag: image-text-to-text tags: - multimodal library_name: transformers

Instinct AI Solar 7B

7B8K ctx3.9 GBlegacy

denseLegacy

Solar 7B is Upstage's efficient language model built on a depth-upscaled architecture. Offers strong instruction following and reasoning performance optimized for single-GPU inference.

Defog SQLCoder 7B

7B8K ctx3.9 GBcurrent

denseLegacy

The model weights were updated at 7 AM UTC on Feb 7, 2024. The new model weights lead to a much more performant model – particularly for joins.

Bingsu exaone 3.0 7.8b it

7.8B0K ctx4.4 GB

denseLegacy

Mradermacher aya expanse 8b orthogonal heretic i1

8B0K ctx4.5 GB

denseLegacy

Bartowski Falcon3 1B Instruct abliterated

1B0K ctx0.6 GB

denseLegacy

Bartowski starcoder2 15b instruct v0.1

15B0K ctx8.4 GB

denseLegacy

Mradermacher starcoder2 15b i1

15B0K ctx8.4 GB

denseLegacy

Alibaba Qwen 2.5 14B

14B131K ctx7.8 GBcurrent

denseLegacy

Qwen2.5 is the latest series of Qwen large language models. For Qwen2.5, we release a number of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters. Qwen2.5 brings the following improvements upon Qwen2:

Srs6901 GGUF SOLARized GraniStral 14B 2102 YeAM HCT 32QKV

14B0K ctx7.8 GB

denseLegacy

Baichuan-inc Baichuan M2 32B Q4 K M

32B0K ctx17.9 GB

denseLegacy

Mradermacher Yi 9B Coder i1

9B0K ctx5 GB

denseLegacy

Mradermacher solar finalised finetuned Model 10.7B i1

10.7B0K ctx6 GB

denseLegacy

Afrideva stablelm 3b 4e1t

3B0K ctx1.7 GB

denseLegacy

Yixman cognitivecomputations Dolphin Mistral 24B Venice Edition

24B0K ctx13.4 GB

denseLegacy

Gabriellarson Mamba Codestral 7B v0.1

7B0K ctx3.9 GB

denseLegacy

Bartowski ai21labs AI21 Jamba Reasoning 3B

3B0K ctx1.7 GB

denseLegacy

Lmstudio-community starcoder2 15b instruct v0.1

15B0K ctx8.4 GB

denseLegacy

Mradermacher Baichuan M3 235B

235B0K ctx131.6 GB

denseLegacy

RichardErkhov stabilityai japanese stablelm base gamma 7b

7B0K ctx3.9 GB

denseLegacy

Tsinghua/Zhipu CodeGeeX 4 9B

9B131K ctx5 GBcurrent

denseLegacy

We introduce CodeGeeX4-ALL-9B, the open-source version of the latest CodeGeeX4 model series. It is a multilingual code generation model continually trained on the GLM-4-9B, significantly enhancing its code generation capabilities. Using a single CodeGeeX4-ALL-9B model, it can support comprehensive functions such as code completion and generation, code interpreter, web search, function call, repository-level code Q&A, covering various scenarios of software development. CodeGeeX4-ALL-9B has achieved highly competitive performance on public benchmarks, such as BigCodeBench and NaturalCodeBench.

Mistral Mistral Nemo 12B

12B128K ctx6.7 GBcurrent

denseLegacy

The Mistral-Nemo-Instruct-2407 Large Language Model (LLM) is an instruct fine-tuned version of the Mistral-Nemo-Base-2407. Trained jointly by Mistral AI and NVIDIA, it significantly outperforms existing models smaller or similar in size.

Browse AI Models

328 models available

/

Status:

Sort:

Microsoft Phi 3 Medium 14B

14B128K ctx7.8 GBcurrent

denseLegacy

Microsoft Phi-4 14B

14B16K ctx7.8 GBcurrent

denseLegacy

Our training data is an extension of the data used for Phi-3 and includes a wide variety of sources from:

Alibaba Qwen 2.5 VL 7B

7B33K ctx3.9 GBcurrent

denseLegacy

license: apache-2.0 language: - en pipeline_tag: image-text-to-text tags: - multimodal library_name: transformers

Instinct AI Solar 7B

7B8K ctx3.9 GBlegacy

denseLegacy

Solar 7B is Upstage's efficient language model built on a depth-upscaled architecture. Offers strong instruction following and reasoning performance optimized for single-GPU inference.

Defog SQLCoder 7B

7B8K ctx3.9 GBcurrent

denseLegacy

The model weights were updated at 7 AM UTC on Feb 7, 2024. The new model weights lead to a much more performant model – particularly for joins.

Bingsu exaone 3.0 7.8b it

7.8B0K ctx4.4 GB

denseLegacy

Mradermacher aya expanse 8b orthogonal heretic i1

8B0K ctx4.5 GB

denseLegacy

Bartowski Falcon3 1B Instruct abliterated

1B0K ctx0.6 GB