Matrix-vector multiplication implemented in off-the-shelf DRAM for Low-Bit LLMsarxiv.org230 pointscpldcpua year ago