Think Smart About Sparse Compute: LatentMoE for Higher Accuracy per Flop, Paramresearch.nvidia.com2 pointsbuildbot5 months ago