When can you support vllm with cpu-only inference?
#27
by
bigmao2012
- opened
it lacks configuration_bitnet.py, modeling_bitnet.py, and maybe more essential files to run bitnet with vllm.
it lacks configuration_bitnet.py, modeling_bitnet.py, and maybe more essential files to run bitnet with vllm.