LLM Inference in pure Java with a GPU acceleration enabledgithub.com/beehive-lab3 pointsmikepapadima year ago