Embedding Quantization: 25-45x retrieval speedup, 32x or 4x less memory usagehuggingface.co4 pointscubie2 years ago