LLMs: Speeding up ALiBi by 3-5x with a hardware-efficient implementationpli.princeton.edu2 pointseitanturok2 years ago