Nvidia releases 8B model with learned 8x KV cache compressionhuggingface.co9 pointsalecco5 months ago