Medusa: Simple Framework for Accelerating LLM Generationgithub.com/FasterDecoding1 pointcmitsakis3 years ago