NVIDIA’s New Turing T4 GPU is going gangbusters in the Cloud Space

Print Friendly, PDF & Email

Earlier this week, NVIDIA announced that the new NVIDIA T4 GPU is now the biggest selling GPU in the Cloud space. Two months after its introduction, the T4 is featured in 57 separate server designs from the world’s leading computer makers. It is also available in the cloud, with the first availability of the T4 for Google Cloud Platform customers.

We have never before seen such rapid adoption of a datacenter processor,” said Ian Buck, vice president and general manager of Accelerated Computing at NVIDIA. “Just 60 days after the T4’s launch, it’s now available in the cloud and is supported by a worldwide network of server makers. The T4 gives today’s public and private clouds the performance and efficiency needed for compute-intensive workloads at scale.”

The T4 accelerates diverse cloud workloads, including high performance computing, deep learning training and inference, machine learning, data analytics, and graphics. Based on the new NVIDIA Turing architecture, it features multi-precision Turing Tensor Cores and new RT Cores, which, when combined with accelerated containerized software stacks, deliver unprecedented performance at scale.

Real-time visualization and online inference workloads need low latency for their end users. We are delighted to partner with NVIDIA to offer T4 GPU support for Google Cloud customers,” said Damion Heredia, senior director of Product Management at Google Cloud. “NVIDIA T4 GPUs for Google Cloud offer a highly scalable, cost-effective, low-latency platform for our ML and visualization customers. Google Cloud’s network capabilities together with the T4 offering enable customers to innovate in new ways, speeding up applications while reducing costs.”

See our complete coverage of SC18 in Dallas

Sign up for our insideHPC Newsletter