DeepSpeed-FastGen: High-Throughput for LLMs via MII and DeepSpeed-Inference

Heykuki News

2 points

3 years ago

No comments

Threaded

Loading comments...

DeepSpeed-FastGen: High-Throughput for LLMs via MII and DeepSpeed-Inference | Heykuki News