Xinity Runtime: Apache 2.0 LLM inference engine for on-premise deploymentgithub.com/xinity-ai3 pointsxinity3 months ago