Running large language models like ChatGPT on a single GPUgithub.com/Ying1123682 points_nhynes3 years ago