Intel reduces latencies of chat LLM app using quantisationcommunity.intel.com5 pointsmariarmestre2 years ago