BERT Archives - High-Performance Computing News Analysis

Optimizing in a Heterogeneous World is (Algorithms x Devices)

November 11, 2019 by staff

In this guest article, our friends at Intel discuss how CPUs prove better for some important Deep Learning. Here’s why, and keep your GPUs handy! Heterogeneous computing ushers in a world where we must consider permutations of algorithms and devices to find the best platform solution. No single device will win all the time, so we need to constantly assess our choices and assumptions.

Filed Under: CPUs, GPUs, FPGAs, Enterprise HPC, Google News Feed, Industry Perspectives, Industry Segments, Machine Learning, Main Feature, News, Research / Education, Resources Tagged With: AI, BERT, big data, CPU, Deep Learning, GPU, Horovod, Intel, Intel DAN, intel xeon, neural machine translation, ResNet-50, Weekly Featured Newsletter Post

NVIDIA TensorRT 6 Breaks 10 millisecond barrier for BERT-Large

September 17, 2019 by staff

Today, NVIDIA released TensorRT 6, which includes new capabilities that dramatically accelerate conversational AI applications, speech recognition, 3D image segmentation for medical applications, as well as image-based applications in industrial automation. TensorRT is a high-performance deep learning inference optimizer and runtime that delivers low latency, high-throughput inference for AI applications. “With today’s release, TensorRT continues to expand its set of optimized layers, provides highly requested capabilities for conversational AI applications, delivering tighter integrations with frameworks to provide an easy path to deploy your applications on NVIDIA GPUs. In TensorRT 6, we’re also releasing new optimizations that deliver inference for BERT-Large in only 5.8 ms on T4 GPUs, making it practical for enterprises to deploy this model in production for the first time.”

Filed Under: CPUs, GPUs, FPGAs, Enterprise HPC, HPC Hardware, HPC Software, Industry Segments, Machine Learning Tagged With: AI, BERT, nvidia, TensorRT

Energy efficiency drives HPC to the cloud

The high-performance computing (HPC) market is witnessing a notable shift towards the cloud, partially driven by the benefits of enhanced energy efficiency. According to Hyperion Research nearly every organization running HPC workloads is either already using or investigating the cloud to accelerate application performance, with the cloud market for HPC workloads forecast to reach $11.5 […]

Download

Optimizing in a Heterogeneous World is (Algorithms x Devices)

NVIDIA TensorRT 6 Breaks 10 millisecond barrier for BERT-Large

Sponsored Guest Articles

Kickstart Your Business to the Next Level with AI Inferencing

White Papers

Energy efficiency drives HPC to the cloud

Featured RSS Feed

More News from insideBIGDATA