Full GPU Inference of LLaMA on Apple Silicon Using Metalgithub.com/ggerganov4 pointsbehnamoh3 years ago