Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Misc with no match

Carbon Emissions

Mixture of Experts

Models

4,766

Full-text search

Active filters: dpo, trl

BramVanroy/GEITje-7B-ultra

Text Generation • Updated Dec 6, 2024 • 952 • 45

argilla/CapybaraHermes-2.5-Mistral-7B

Updated Mar 4, 2024 • 23 • 69

TheBloke/CapybaraHermes-2.5-Mistral-7B-GGUF

Updated Jan 31, 2024 • 7.3k • 111

TheBloke/CapybaraHermes-2.5-Mistral-7B-GPTQ

Updated Jan 31, 2024 • 563 • 57

ENERGY-DRINK-LOVE/eeve_dpo-v3

Text Generation • Updated Mar 7, 2024 • 1.8k • 1

dmis-lab/self-biorag-7b-olaph

Text Generation • Updated May 22, 2024 • 20 • 3

tanliboy/zephyr-qwen2-7b-dpo

Text Generation • Updated Jun 20, 2024 • 11 • 1

mradermacher/zephyr-qwen2-7b-dpo-GGUF

Updated 30 days ago • 197 • 1

Magpie-Align/Llama-3.1-8B-Magpie-Align-v0.1

Text Generation • Updated Aug 19, 2024 • 1.8k • 4

Magpie-Align/Llama-3.1-8B-Magpie-Align-v0.2

Updated Aug 19, 2024 • 14 • 3

mradermacher/Llama-3.1-8B-Magpie-Align-v0.1-GGUF

Updated Jan 22 • 232 • 1

mradermacher/Llama-3.1-8B-Magpie-Align-v0.1-i1-GGUF

Updated Jan 22 • 353 • 1

mlabonne/TwinLlama-3.1-8B-DPO

Text Generation • Updated Oct 6, 2024 • 56 • 11

Magpie-Align/MagpieLM-4B-Chat-v0.1

Text Generation • Updated Dec 9, 2024 • 6 • 20

Magpie-Align/MagpieLM-8B-Chat-v0.1

Text Generation • Updated Dec 9, 2024 • 1.86k • 22

tanliboy/lambda-llama-3-8b-ipo-test

Text Generation • Updated Sep 21, 2024 • 14 • 1

trl-lib/Qwen2-0.5B-DPO

Text Generation • Updated Sep 27, 2024 • 35 • 5

HumanLLMs/Human-Like-LLama3-8B-Instruct

Text Generation • Updated Jan 13 • 69 • 19

HumanLLMs/Human-Like-Mistral-Nemo-Instruct-2407

Text Generation • Updated Jan 13 • 99 • 13

mradermacher/llama3-8b-dpo-GGUF

Updated Nov 5, 2024 • 80 • 1

mradermacher/Humanish-Qwen2.5-7B-Instruct-i1-GGUF

Updated Jan 11 • 543 • 3

TheDrunkenSnail/Human-Like-Mistral-Nemo-Instruct-2407-Q4_K_M-GGUF

Updated Nov 16, 2024 • 17 • 1

mradermacher/Llama-3.1-8B-sft-SPIN-human-dataset-i1-GGUF

Updated Mar 5 • 92 • 1

mradermacher/Human-Like-LLama3-8B-Instruct-GGUF

Updated Jan 12 • 231 • 1

mradermacher/Human-Like-LLama3-8B-Instruct-i1-GGUF

Updated Jan 12 • 563 • 2

mradermacher/Human-Like-Qwen2.5-7B-Instruct-GGUF

Updated Jan 13 • 69 • 2

mradermacher/Human-Like-Mistral-Nemo-Instruct-2407-GGUF

Updated Jan 14 • 67 • 1

mradermacher/Human-Like-Qwen2.5-7B-Instruct-i1-GGUF

Updated Jan 13 • 67 • 1

robinsmits/Schaapje-2B-Chat-V1.0

Text Generation • Updated Jan 19 • 12 • 4

NikolayKozloff/Human-Like-Mistral-Nemo-Instruct-2407-Q8_0-GGUF

Text Generation • Updated Jan 16 • 2 • 1