Mixed Precision Training: Faster Deep Learning Without Losing Accuracy
Training today’s deep learning models is resource-hungry. Models have billions of parameters, and every step requires trillions of floating-point operations. To make training feasible, researchers and engineers rely on mixed…