Exllamav2: Inference library for running LLMs locally on consumer-class GPUsgithub.com/turboderp322 pointsPalmik3 years ago