GPU-powered HPC Workloads in the Cloud with AWS and NVIDIA

[SPONSORED GUEST ARTICLE] Across industries cloud-based high performance computing (HPC) is on the rise. Find out from AWS and NVIDIA how GPU-accelerated compute is helping organizations run more HPC workloads and AI/ML jobs faster, in a more energy-efficient way.

Reduce Costs while Accelerating Data-intensive HPC Workloads

Access virtually unlimited infrastructure with HPC optimized instances and quick interconnect speeds to run more complex (FEA) simulations faster. Reduce product development costs, improve product quality, and shorten time-to-market.

AWS Announces GA of EC2 Trn1 Instances for ML Model Training 

SEATTLE — Oct. 10, 2022 — Amazon Web Services today announced the general availability of Amazon Elastic Compute Cloud (Amazon EC2) Trn1 instances powered by AWS-designed Trainium chips. Trn1 instances are built for high-performance training of machine learning models in the cloud. AWS said the offering saves up to 50 percent cost-to-train savings over comparable […]

AWS Launches Storage-Optimized EC2 Instances (I4i) Powered by Intel Xeon Scalable (Ice Lake) Processors

April 27, 2022 — Amazon Web Services has launched new new I4i instances powered by the latest generation Intel Xeon Scalable (Ice Lake) Processors with an all-core turbo frequency of 3.5 GHz. The company said The instances offer up to 30 TB of NVMe storage using AWS Nitro SSD devices that are custom-built by AWS, and are […]

Weka Claims 6 Records on STAC-M3 WEKAFS File Systems on EC2

CAMPBELL, Calif. – June 8, 2021 – WekaIO (Weka), the data platform for artificial intelligence/machine learning (AI/ML), life sciences research, and high-performance computing (HPC), today announced record-breaking performance of its Weka File System (WekaFS) on Amazon Elastic Compute Cloud (Amazon EC2) according to the STAC-M3 Benchmark. An independent audit, conducted by Securities Technology Analysis Center (STAC), showed that […]

AWS Launches Nvidia GPU-Driven EC2 P4d Instances for AI, HPC

Amazon Web Services today announced the general availability of Amazon EC2 P4d Instances powered by Nvidia GPUs with EC2 UltraClusters capability delivering 3x faster performance, up to 60 percent lower cost, and 2.5x more GPU memory for machine learning training and HPC workloads compared to previous-generation P3 instances, according to AWS. The company said P4d […]

White Paper: Basement Supercomputer beats Amazon EC2 on Cost and Performance

The good folks at Basement Supercomputing have published a white paper comparing one of the company’s on-prem Limulus personal HPC appliance to Amazon EC2 Cloud instances. The economics are quite interesting. “In this careful study equivalent EC2 cluster instances were configured using current pricing (Spring 2019). A comparison of Basement Supercomputing Limulus appliance workstations with the Amazon EC2 (Elastic Compute Cloud) is presented. The capabilities of both approaches are discussed along with a detailed comparison of two Limulus appliance designs.”

Amazon and Libfabric: A case study in flexible HPC Infrastructure

Brian Barrett from Amazon gave this talk at the 2018 OpenFabrics Workshop. “As network performance becomes a larger bottleneck in application performance, AWS is investing in improving HPC network performance. Our initial investment focused on improving performance in open source MPI implementations, with positive results. Recently, however, we have pivoted to focusing on using libfabric to improve point to point performance.”

Video: Introducing the Nitro Hypervisor – the Evolution of Amazon EC2 Virtualization

In this video from AWS Reinvent, Anthony Liguori from Amazon presents: Nitro Hypervisor – the Evolution of Amazon EC2 Virtualization. “The new Nitro hypervisor for Amazon EC2, introduced with the launch of C5 instances, is a component that primarily provides CPU and memory isolation for C5 instances. VPC networking, and EBS storage resources are implemented by dedicated hardware components that are part of all current generation EC2 instance families. It is built on core Linux Kernel-based Virtual Machine (KVM) technology, but does not include general purpose operating system components.

AWS Adds FPGAs to its Public Cloud

Amazon Web Services chief evangelist Jeff Barr announced in a recent blog post that the company was adding Xilinx FPGAs to its Amazon Elastic Compute Cloud (Amazon EC2). The addition of this new hardware will allow users to create accelerated FPGA applications, but AWS will also let users sell these applications on the AWS Marketplace. “We are giving you the ability to design your own logic, simulate and verify it using cloud-based tools, and then get it to market in a matter of days,” said Barr.