Computational Bottlenecks of Training Small-Scale Large Language Modelsmachinelearning.apple.com2 pointssandwichsphinx2 years ago