-
-
-
-
-
-
Inference Providers
Active filters:
dpo, trl
BramVanroy/GEITje-7B-ultra
Text Generation
•
Updated
•
952
•
45
argilla/CapybaraHermes-2.5-Mistral-7B
Updated
•
23
•
69
TheBloke/CapybaraHermes-2.5-Mistral-7B-GGUF
Updated
•
7.3k
•
111
TheBloke/CapybaraHermes-2.5-Mistral-7B-GPTQ
Updated
•
563
•
57
ENERGY-DRINK-LOVE/eeve_dpo-v3
Text Generation
•
Updated
•
1.8k
•
1
dmis-lab/self-biorag-7b-olaph
Text Generation
•
Updated
•
20
•
3
tanliboy/zephyr-qwen2-7b-dpo
Text Generation
•
Updated
•
11
•
1
mradermacher/zephyr-qwen2-7b-dpo-GGUF
Updated
•
197
•
1
Magpie-Align/Llama-3.1-8B-Magpie-Align-v0.1
Text Generation
•
Updated
•
1.8k
•
4
Magpie-Align/Llama-3.1-8B-Magpie-Align-v0.2
Updated
•
14
•
3
mradermacher/Llama-3.1-8B-Magpie-Align-v0.1-GGUF
mradermacher/Llama-3.1-8B-Magpie-Align-v0.1-i1-GGUF
mlabonne/TwinLlama-3.1-8B-DPO
Text Generation
•
Updated
•
56
•
11
Magpie-Align/MagpieLM-4B-Chat-v0.1
Text Generation
•
Updated
•
6
•
20
Magpie-Align/MagpieLM-8B-Chat-v0.1
Text Generation
•
Updated
•
1.86k
•
22
tanliboy/lambda-llama-3-8b-ipo-test
Text Generation
•
Updated
•
14
•
1
trl-lib/Qwen2-0.5B-DPO
Text Generation
•
Updated
•
35
•
5
HumanLLMs/Human-Like-LLama3-8B-Instruct
Text Generation
•
Updated
•
69
•
19
HumanLLMs/Human-Like-Mistral-Nemo-Instruct-2407
Text Generation
•
Updated
•
99
•
13
mradermacher/llama3-8b-dpo-GGUF
Updated
•
80
•
1
mradermacher/Humanish-Qwen2.5-7B-Instruct-i1-GGUF
TheDrunkenSnail/Human-Like-Mistral-Nemo-Instruct-2407-Q4_K_M-GGUF
Updated
•
17
•
1
mradermacher/Llama-3.1-8B-sft-SPIN-human-dataset-i1-GGUF
mradermacher/Human-Like-LLama3-8B-Instruct-GGUF
mradermacher/Human-Like-LLama3-8B-Instruct-i1-GGUF
mradermacher/Human-Like-Qwen2.5-7B-Instruct-GGUF
mradermacher/Human-Like-Mistral-Nemo-Instruct-2407-GGUF
mradermacher/Human-Like-Qwen2.5-7B-Instruct-i1-GGUF
robinsmits/Schaapje-2B-Chat-V1.0
Text Generation
•
Updated
•
12
•
4
NikolayKozloff/Human-Like-Mistral-Nemo-Instruct-2407-Q8_0-GGUF
Text Generation
•
Updated
•
2
•
1