Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face
Intel and Hugging Face collaborated to demonstrate the real-world value of upgrading to Google’s latest
C4 Virtual Machine (VM) running on Intel® Xeon® 6 processors (codenamed Granite Rapids (GNR)). We specifically wanted to benchmark improvements in the text generation performance of OpenAI GPT OSS Large Language Model(LLM).
The results are in, and they are impressive, demonstrating a 1.7x improvement in Total Cost of Ownership(TCO) over the previous-generation Google C3 VM instances. The Google Cloud C4 VM instance further resulted in:
- 1.4x to 1.7x TPOT throughput/vCPU/dollar
- Lower price per hour over
C3VM