-
-
-
-
-
-
Inference Providers
Active filters:
8-bit
biomap-research/proteinglm-100b-int4
MaziyarPanahi/Mistral-Nemo-Instruct-2407-GGUF
Text Generation
•
Updated
•
219k
•
44
PrunaAI/TheDrummer-Smegmma-9B-v1-bnb-8bit-smashed
Updated
•
8
•
1
MaziyarPanahi/Meta-Llama-3.1-8B-Instruct-GGUF
Text Generation
•
Updated
•
207k
•
19
MaziyarPanahi/Meta-Llama-3.1-70B-Instruct-GGUF
Text Generation
•
Updated
•
206k
•
39
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w8a8
Text Generation
•
Updated
•
8.36k
•
16
MaziyarPanahi/Mistral-Large-Instruct-2407-GGUF
Text Generation
•
Updated
•
18.3k
•
21
RedHatAI/Meta-Llama-3.1-70B-Instruct-quantized.w8a8
Text Generation
•
Updated
•
29.3k
•
20
RedHatAI/Meta-Llama-3.1-8B-quantized.w8a8
Text Generation
•
Updated
•
408
•
4
shuyuej/Mistral-Nemo-Instruct-2407-GPTQ-INT8
Updated
•
52
•
2
mlconvexai/jais-13b-chat_bitsandbytes_8bit
Text Generation
•
Updated
•
23
•
1
LoneStriker/Hermes-3-Llama-3.1-8B-8.0bpw-h8-exl2
Updated
•
5
•
1
johnsnowlabs/JSL-MedLlama-3-8B-v17-8bits
Updated
•
11
•
1
MaziyarPanahi/Phi-3.5-mini-instruct-GGUF
Text Generation
•
Updated
•
207k
•
12
Statuo/NemoMix-Unleashed-EXL2-8bpw
Text Generation
•
Updated
•
102
•
5
speakleash/Bielik-11B-v2.2-Instruct-Quanto-8bit
Text Generation
•
Updated
•
1
•
4
Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int8
Image-Text-to-Text
•
Updated
•
1.8k
•
29
Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int8
Image-Text-to-Text
•
Updated
•
759
•
14
watsonchua/hansard-gemma-2-9b-lora
illuin/llama-3-grouse
Text Generation
•
Updated
•
1
MaziyarPanahi/Yi-Coder-1.5B-Chat-GGUF
Text Generation
•
Updated
•
210k
•
7
MaziyarPanahi/Yi-Coder-9B-Chat-GGUF
Text Generation
•
Updated
•
182k
•
5
qeternity/Mistral-Large-Instruct-2407-w8a8
Updated
•
5
•
1
HF1BitLLM/Llama3-8B-1.58-Linear-10B-tokens
Text Generation
•
Updated
•
35
•
10
HF1BitLLM/Llama3-8B-1.58-100B-tokens
Text Generation
•
Updated
•
2.52k
•
181
MaziyarPanahi/reader-lm-0.5b-GGUF
Text Generation
•
Updated
•
102
•
3
MaziyarPanahi/solar-pro-preview-instruct-GGUF
Text Generation
•
Updated
•
204k
•
25
Qwen/Qwen2-VL-72B-Instruct-GPTQ-Int8
Image-Text-to-Text
•
Updated
•
650
•
11
sanjaymk/jain_project_custom_llama
Text Generation
•
Updated
•
2
•
1
Qwen/Qwen2.5-72B-Instruct-GPTQ-Int8
Text Generation
•
Updated
•
5.41k
•
24