Quantized Llama models with increased speed and a reduced memory footprint

Heykuki News

508 points

2 years ago

122 comments

Threaded

Loading comments...

Quantized Llama models with increased speed and a reduced memory footprint | Heykuki News