HK

AutoMegaKernel: Compile an LLM into one provably-correct CUDA megakernel | Heykuki News