Spaces:

joey1101
/

intel

Build error

joey1101 commited on Mar 24

Commit

7ed7879

verified ·

1 Parent(s): 38823b1

Create LLM_low_bit_optimize.py

Files changed (1) hide show

LLM_low_bit_optimize.py ADDED Viewed

+from ipex_llm.transformers import AutoModelForCausalLM
+from transformers import LlamaTokenizer
+llm = AutoModelForCausalLM.from_pretrained("checkpoints\\Llama-2-7b-chat-hf",load_in_low_bit="sym_int4")
+llm.save_low_bit("checkpoints\\Llama-2-7b-chat-hf-INT4")
+tokenizer = LlamaTokenizer.from_pretrained("checkpoints\\Llama-2-7b-chat-hf\\")
+tokenizer.save_pretrained("checkpoints\\Llama-2-7b-chat-hf-INT4")