Accelerating PyTorch Transformers with Intel Sapphire Rapids, part 1

Julien Simon's avatar

About a year ago, we showed you how to distribute the training of Hugging Face transformers on a cluster or third-generation Intel Xeon Scalable CPUs (aka Ice Lake). Recently, Intel has launched the fourth generation of Xeon CPUs, code-named Sapphire Rapids, with exciting new instructions that speed up operations commonly found in deep learning models.

In this post, you

 

 

 

To finish reading, please visit source site