T-Mac: Low-bit LLM inference on CPU/NPU with lookup tablegithub.com/microsoft5 pointsnateb20229 months ago