Transformer language models are doing something more general – LessWronglesswrong.com15 pointsbilsbie4 years ago