Accelerate BERT inference with Hugging Face Transformers and AWS Inferentia
notebook: sagemaker/18_inferentia_inference
The adoption of BERT and Transformers continues to grow. Transformer-based models are now not only achieving state-of-the-art performance in Natural Language Processing but also for Computer Vision, Speech, and Time-Series. 💬 🖼 🎤 ⏳
Companies are now slowly moving from the experimentation and research phase to the production phase in