Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

8-bit precision

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

Mixture of Experts

text-embeddings-inference

Carbon Emissions

Models

27,744

Full-text search

Active filters: 8-bit

biomap-research/proteinglm-100b-int4

Updated Mar 17 • 263 • 8

MaziyarPanahi/Mistral-Nemo-Instruct-2407-GGUF

Text Generation • Updated Jul 22, 2024 • 219k • 44

PrunaAI/TheDrummer-Smegmma-9B-v1-bnb-8bit-smashed

Updated Jul 21, 2024 • 8 • 1

MaziyarPanahi/Meta-Llama-3.1-8B-Instruct-GGUF

Text Generation • Updated Jul 23, 2024 • 207k • 19

MaziyarPanahi/Meta-Llama-3.1-70B-Instruct-GGUF

Text Generation • Updated Jul 29, 2024 • 206k • 39

RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w8a8

Text Generation • Updated Oct 23, 2024 • 8.36k • 16

MaziyarPanahi/Mistral-Large-Instruct-2407-GGUF

Text Generation • Updated Jul 26, 2024 • 18.3k • 21

RedHatAI/Meta-Llama-3.1-70B-Instruct-quantized.w8a8

Text Generation • Updated Feb 11 • 29.3k • 20

RedHatAI/Meta-Llama-3.1-8B-quantized.w8a8

Text Generation • Updated Oct 23, 2024 • 408 • 4

shuyuej/Mistral-Nemo-Instruct-2407-GPTQ-INT8

Updated Aug 7, 2024 • 52 • 2

mlconvexai/jais-13b-chat_bitsandbytes_8bit

Text Generation • Updated Oct 27, 2024 • 23 • 1

LoneStriker/Hermes-3-Llama-3.1-8B-8.0bpw-h8-exl2

Updated Aug 15, 2024 • 5 • 1

johnsnowlabs/JSL-MedLlama-3-8B-v17-8bits

Updated Aug 16, 2024 • 11 • 1

MaziyarPanahi/Phi-3.5-mini-instruct-GGUF

Text Generation • Updated Aug 20, 2024 • 207k • 12

Statuo/NemoMix-Unleashed-EXL2-8bpw

Text Generation • Updated Aug 23, 2024 • 102 • 5

speakleash/Bielik-11B-v2.2-Instruct-Quanto-8bit

Text Generation • Updated Oct 7, 2024 • 1 • 4

Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int8

Image-Text-to-Text • Updated Sep 21, 2024 • 1.8k • 29

Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int8

Image-Text-to-Text • Updated Sep 21, 2024 • 759 • 14

watsonchua/hansard-gemma-2-9b-lora

Updated Sep 2, 2024 • 1

illuin/llama-3-grouse

Text Generation • Updated Sep 17, 2024 • 1

MaziyarPanahi/Yi-Coder-1.5B-Chat-GGUF

Text Generation • Updated Sep 4, 2024 • 210k • 7

MaziyarPanahi/Yi-Coder-9B-Chat-GGUF

Text Generation • Updated Sep 4, 2024 • 182k • 5

qeternity/Mistral-Large-Instruct-2407-w8a8

Updated Sep 7, 2024 • 5 • 1

HF1BitLLM/Llama3-8B-1.58-Linear-10B-tokens

Text Generation • Updated Sep 18, 2024 • 35 • 10

HF1BitLLM/Llama3-8B-1.58-100B-tokens

Text Generation • Updated Sep 19, 2024 • 2.52k • 181

MaziyarPanahi/reader-lm-0.5b-GGUF

Text Generation • Updated Sep 11, 2024 • 102 • 3

MaziyarPanahi/solar-pro-preview-instruct-GGUF

Text Generation • Updated Sep 13, 2024 • 204k • 25

Qwen/Qwen2-VL-72B-Instruct-GPTQ-Int8

Image-Text-to-Text • Updated Sep 24, 2024 • 650 • 11

sanjaymk/jain_project_custom_llama

Text Generation • Updated Sep 17, 2024 • 2 • 1

Qwen/Qwen2.5-72B-Instruct-GPTQ-Int8

Text Generation • Updated Oct 9, 2024 • 5.41k • 24