QUIK is a method for quantizing LLM post-training weights to 4 bit precisiongithub.com/IST-DASLab85 pointsanigbrowl3 years ago