Accelerate BERT inference with Hugging Face Transformers and AWS Inferentia

Philipp Schmid's avatar

notebook: sagemaker/18_inferentia_inference

The adoption of BERT and Transformers continues to grow. Transformer-based models are now not only achieving state-of-the-art performance in Natural Language Processing but also for Computer Vision, Speech, and Time-Series. 💬 🖼 🎤 ⏳

Companies are now slowly moving from the experimentation and research phase to the production phase in

 

 

 

To finish reading, please visit source site