LayerSkip: Enabling Early Exit Inference and Self-Speculative Decodinggithub.com/facebookresearch1 pointzerojames2 years ago