llama.cpp fixes to run Bonsai 1-bit models on CPU (incl AVX512) and AMD GPUs | hypedar