How can I run it with 400MB memory as was claimed in the model card ?
What are the quantisation changes to be made to make it happen ?
build bitnet.cpp
· Sign up or log in to comment