MegaBlocks: Efficient Sparse Training with Mixture-of-Experts | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

MegaBlocks: Efficient Sparse Training with Mixture-of-Experts | Heykuki News

MegaBlocks: Efficient Sparse Training with Mixture-of-Experts

github.com/stanford-futuredata

6 points

3 years ago

1 comment

Threaded

Loading comments...