H3: Language Modeling with State Space Models and (Almost) No Attentionhazyresearch.stanford.edu1 pointanewhnaccount23 years ago