Code for Scaling Transformer to 1M tokens and beyond with RMT (arxiv.org)github.com/booydar1 pointmysterybox3 years ago