‌‌‌When can you support vllm with cpu-only inference?

#27
by bigmao2012 - opened

it lacks configuration_bitnet.py, modeling_bitnet.py, and maybe more essential files to run bitnet with vllm.

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment