Recent Developments in LLM Architectures: KV Sharing, MHC, Compressed Attentionmagazine.sebastianraschka.com3 pointspretexta month ago