Nvidia: Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery [pdf]research.nvidia.com4 pointstosh5 months ago