Quartet: Native FP4 Training Can Be Optimal for Large Language Models
R. L. Castro, A. Panferov, S. Tabesh, O. Sieberling, J. Chen, M. Nikdan, S. Ashkboos, D. Alistarh. (2025). "Quartet: Native FP4 Training Can Be Optimal for Large Language Models." arXiv preprint arXiv:2505.14669.