Repurposed Nvidia RT Cores for LLM routing (218x speedup)github.com/JordiSilvestre2 pointsJordisilvestre2 months ago