HK

KVarN: Native vLLM backend for KV-cache quantization by Huawei | Heykuki News