Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery [pdf]research.nvidia.com2 pointsgmays5 months ago