ARM goes Big: HPE Builds Petaflop Supercomputer for Sandia

Print Friendly, PDF & Email

Today HPE announced its collaboration with Sandia National Laboratories and the U.S. Department of Energy (DOE) to deliver the world’s largest Arm supercomputer. As part of the Vanguard program, Astra, the new Arm-based system, will be used by the National Nuclear Security Administration (NNSA) to run advanced modeling and simulation workloads for addressing areas such as national security, energy and science.

By introducing Arm processors with the HPE Apollo 70, a purpose-built HPC architecture, we are bringing powerful elements, like optimal memory performance and greater density, to supercomputers that existing technologies in the market cannot match,” said Mike Vildibill, vice president, Advanced Technologies Group, HPE. “Sandia National Laboratories has been an active partner in leveraging our Arm-based platform since its early design, and featuring it in the deployment of the world’s largest Arm-based supercomputer, is a strategic investment for the DOE and the industry as a whole as we race toward achieving exascale computing.”

In today’s data-intensive environment, there is an insatiable demand for higher compute performance as organizations continue to conduct research-intensive tasks that require processing and analyzing large data sets to address challenges across medicine, climate change, space, and oil and gas exploration. To accelerate this research and discovery process, there is a global race to achieve the next leap in supercomputing with an exascale-class system that is 10 times greater than today’s systems. As a strong contender in this race, the U.S. Department of Energy has moved up its timeline to deliver the first exascale system by 2021 and by doing so, is investing in diverse architectures to support necessary performance, memory and scalable capabilities to drive engineering to the next great frontier in computing.

By introducing new processors like Arm to the HPC ecosystem, which has been historically dominated by x86-based technologies, HPE is building a diverse network to offer more competitive options to power next-generation supercomputers while accelerating the path to exascale. HPE is also delivering memory-centric designs to support rapidly growing data-intensive HPC workloads while enabling greater density with more performance-packed servers by bringing robust, Arm-based HPC technologies to power the Astra supercomputer and future systems.

With Astra, a major stepping stone in HPE’s path to exascale, HPE is delivering over 2.3 theoretical peak petaflops of performance4, 33% better memory performance than traditional market offerings5, and greater system density. The NNSA, an agency within the DOE that is responsible for the management and security of the nation’s nuclear weapons, nuclear nonproliferation, and naval reactor programs, will use Astra to advance modeling and simulation tools to improve analysis on data-intensive science experiments.

The National Nuclear Security Administration is committed to the advancement of high performance computing in support of the nation’s stockpile stewardship,” said Mark Anderson, director, Office of Advanced Simulation and Computing and Institutional Research and Development, National Nuclear Security Administration. “By investigating emerging advanced architectures, like Arm, as part of the Vanguard program, we hope to gain new approaches to predictive analysis and simulation modeling that addresses complex, data-driven engineering and science simulations.”

Astra will be deployed at Sandia National Laboratories and will run on the HPE Apollo 70. This purpose-built HPC platform is based on the Cavium ThunderX2 Arm processor. Astra is comprised of over 145,000 cores in 2,592 dual-processor servers and offers greater density with four compute nodes in a 2U form factor.

We are committed to fielding the most advanced technologies as part of the Vanguard program,” said James Laros, Vanguard program lead, Sandia National Laboratories. “We are collaborating with HPE to advance the Arm ecosystem and prove the viability of this architecture to support our national security mission.”

Additional HPE features that deliver optimal performance include:

  • HPE Performance Software – Message Passing Interface (MPI): A high-performance MPI library complemented with profiling and runtime optimization tools designed to boost performance of existing MPI applications without the need to recompile existing codes. HPE’s implementation of
  • MPI is highly optimized, enabling faster performance and scalability when powering HPC applications.
  • HPE MCS-300 Cooling Unit: A power-efficient liquid cooling system that supports high-density systems without compromising on energy consumption. Using a liquid-cooled infrastructure enables more density, improves operational environment for HPC systems, and cost savings compared to traditional air-cooled solutions.
  • HPE Performance Cluster Manager: HPE’s first integrated system management software for HPC clusters that enables fast system setup and lowers total cost of ownership, increasing productivity and providing a better return on hardware investments.
  • HPE Apollo 4520: An optimized storage solution that offers a high performing, all-flash Lustre filesystem for extreme bandwidth and reduced latency on data access. The solution increases application performance, allowing more computation to take place on the system.

For additional insight into HPE’s HPC offerings and industry collaborations, please visit HPE’s High Performance Computing Solutions.

Stop by HPE booth #E1030 to view a demo on the HPE Apollo 70 platform and learn how the Arm-based system is powering next-generation supercomputers at ISC 2018 in Frankfurt, Germany.

Sign up for our insideHPC Newsletter

Comments

  1. Brent Gorda says

    This is a significant step forward for HPC both in terms of performance and efficiency. Having just gotten started, the ThunderX2 from Cavium has much more memory bandwidth than Xeon offerings. As a result (per recent CUG papers), performance exceeds Xeon for many applications. The ARM design is also more efficient, which definitely scratches an itch for the HPC community. After a long drought, it appears we have options for the cpu again. This looks to be a real contender for future HPC systems.