New algorithm can route sequences with 1M+ token embeddings in one GPUgithub.com/glassroom21 pointscs7024 years ago