Show HN: TurboPrefill – Multi-GPU prefill acceleration for llama.cppgithub.com/sergey-automation2 pointstrykhlieb19 days agoTurboPrefill is an attempt to make layer-split multi-GPU configurations spend less time waiting and more time computing during prefill.