Model Card for Model ID

Phi-4 unsloth model trained to generate deepseek-r1 styled reasoning based on a system prompt "detailed thinking on"

Model Details

Model Description

This fine-tuned model generated ehanced chain-of-thoughts, reasoning and produce "Aha-moments" akin to deepseek whenever the system prompt is set to "detailed thinking on". Test any questions from trending datasets about code, math and science, with the system prompt set and unset and you can clearly see the difference in the generated output. A Modelfile is included with the gguf files that can be used to load the model into Ollama. You have to set the system prompt manually after loading the model in Ollama since by default there is no system prompt. You can use /set SYSTEM "detailed thinking on" and then input your prompt. The Modelfile includes optimal parameters but you can experimment with different set of parameters based on your desired goal/output.

Uses

Tasks requiring reasoning, chain-of-thoughts, several approaches etc...

Recommendations

Use the parameters in the Modefile and set system prompt to "detailed thinking on" whenever you require long reasoning outputs. Set system parameter off when you want a direct to the point quick answer without reasoning chains. It is important to utilize the chat-template embedded in the Modelfile to ensure optimal generations and avoid endless generations or loops.

How to Get Started with the Model

Download the gguf file and Modelfile into the same folder then use ollama create phi4-deepseek -f Modelfile. Then run the model using ollama run. Set the system parameter Finally, start prompting.

Training Details

Training Data

Nvidia datasets containing reasoning context produced by DeeepSeek-R1.

Training Procedure

Unsloth SFT Trainer

Downloads last month
262
GGUF
Model size
14.7B params
Architecture
llama
Hardware compatibility
Log In to view the estimation

4-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ykarout/phi4-deepseek-r1-distilled-gguf-v5

Base model

microsoft/phi-4
Quantized
(125)
this model

Dataset used to train ykarout/phi4-deepseek-r1-distilled-gguf-v5