FP8: Efficient model inference with 8-bit floating point numbersbaseten.co2 pointsphilipkiely2 years ago