Shimmy v1.7.0: Running 42B Moe Models on Consumer GPUs with 99.9% VRAM Reductiongithub.com/Michael-A-Kuykendall3 pointsMKuykendall8 months ago