Cray to Build El Capitan Exascale Supercomputer at LLNL

Today the Department of Energy announced that Cray will build the NNSA’s first exascale supercomputer, “El Capitan.” To be hosted at LLNL, El Capitan will have a peak performance of more than 1.5 exaflops and an anticipated delivery in late 2022. The total contract award is valued at $600 million.

The Department of Energy is the world leader in supercomputing and El Capitan is a critical addition to our next-generation systems,” said U.S. Energy Secretary Rick Perry. “El Capitan’s advanced capabilities for modeling, simulation and artificial intelligence will help push America’s competitive edge in energy and national security, allow us to ask tougher questions, solve greater challenges and develop better solutions for generations to come.”

Featuring advanced capabilities for modeling, simulation and artificial intelligence (AI), based on Cray’s new Shasta architecture, El Capitan is projected to run national nuclear security applications at more than 50 times the speed of LLNL’s Sequoia system. Depending on the application, El Capitan will run roughly 10 times faster on average than LLNL’s Sierra system, currently the world’s second most powerful supercomputer at 125 petaflops of peak performance. Projected to be at least four times more energy efficient than Sierra, El Capitan is expected to go into production by late 2023, servicing the needs of NNSA’s Tri-Laboratory community: Lawrence Livermore National Laboratory, Los Alamos National Laboratory and Sandia National Laboratories.

El Capitan will be DOE’s third exascale-class supercomputer, following Argonne National Laboratory’s “Aurora” and Oak Ridge National Laboratory’s “Frontier” system. All three DOE exascale supercomputers will be built by Cray utilizing their Shasta architecture, Slingshot interconnect and new software platform.

At press time, DOE did not disclose the chip and accelerator component suppliers for El Capitan. The Shasta architecture is rather agnostic in that it can support Intel processors/accelerators (as with Aurora) and AMD processors/accelerators (as with Frontier). Cray and HPE both have supercomputing platforms based on Arm processors, so that would seem to be on the table as well.

Developed as part of the second phase of the Collaboration of Oak Ridge, Argonne and Livermore (CORAL-2) procurement, El Capitan will serve the mission needs of NNSA. It will perform essential functions for the Stockpile Stewardship Program, which supports U.S. national security missions through leading-edge scientific, engineering and technical tools and expertise, ensuring the safety, security and effectiveness of the nation’s nuclear stockpile in the absence of underground testing. El Capitan will be used to make critical assessments necessary for addressing evolving threats to national security, and other purposes such as nonproliferation and nuclear counterterrorism.

NNSA is modernizing the Nuclear Security Enterprise to face 21st-century threats,” said Lisa E. Gordon-Hagerty, DOE under secretary for Nuclear Security and NNSA administrator. “El Capitan will allow us to be more responsive, innovative and forward-thinking when it comes to maintaining a nuclear deterrent that is second to none in a rapidly evolving threat environment.”

Exascale performance will be delivered by a heterogeneous Central Processing Unit (CPU)/Graphical Processing Unit (GPU) architecture. This architecture will allow researchers to run exploratory 3D simulations at resolutions that are currently unobtainable and ensembles of 3D calculations at resolutions that are difficult, time-consuming or even impossible using today’s state-of-the art supercomputers. 3D simulations are becoming essential to meet the unprecedented demands of the NNSA Life Extension Programs (LEPs) and address nuclear weapon aging issues for which researchers have no nuclear test data.

El Capitan is expected to provide invaluable simulation capabilities for the LEPs by providing scientists and weapon designers the computational tools to explore the use of new materials and components, improve robustness and safety, reduce maintenance costs and reduce manufacturing and production costs. Exascale computing also will permit faster, more detailed 3D modeling and simulation. These capabilities also will benefit areas of basic science beyond nuclear security, requiring high-resolution multi-physics simulations, such as cancer research, optimizing design for additive manufacturing, climate, seismology and astrophysics.

We are proud to partner with Cray in the coming years to usher in the era of exascale computing at LLNL, beginning the next chapter in the long, storied history we have at this Laboratory in leading-edge supercomputing,” said Lab Director Bill Goldstein. “El Capitan will allow our scientists and engineers to get answers to critical questions about the nuclear stockpile faster and more accurately than ever before, improving our efficiency and productivity, and enhancing our ability to reach our mission and national security goals.”

El Capitan will be built on Cray’s Shasta supercomputing architecture and will be comprised of Shasta compute nodes and a future generation of ClusterStor storage. This unique architecture will be connected with Cray’s new Slingshot high-speed interconnect. The Shasta hardware and software architecture can accommodate a variety of processors and accelerators, making it possible for Cray and LLNL to work together in the coming months to finalize the decision on which processor and GPU components will be used at the node level to maximize performance for the enormous projected workloads. The platform also will utilize Cray’s new system and analytics software stack, which will deliver the scalability and flexibility needed for exascale computing. It also will enable the converged use of modeling, simulation and AI in support of the Lab’s research missions.

We are honored to be a part of this historic moment to deliver the next U.S. exascale supercomputing system to the DOE, NNSA and LLNL in support of their incredibly important mission,” said Pete Ungaro, president and CEO of Cray. “We couldn’t be more excited that Cray’s Shasta systems, software and Slingshot interconnect will be the foundation for the first three U.S. exascale systems. El Capitan will incorporate foundational new software technologies from Cray that are critical for the exascale era where digital transformation and the convergence of modeling simulation, analytics and AI are driving new, data-intensive workloads at extreme scale.”

As part of the collaborative effort with the DOE Exascale Computing Initiative, LLNL scientists and principal investigators already are working on application development and software technologies that will be needed to ensure the necessary pieces are in place for El Capitan to have a fully functional exascale ecosystem from Day One. A center of excellence will be established shortly in collaboration with Cray to port and optimize existing codes to run on El Capitan with a goal of enabling programmatic work to initiate immediately after the machine is accepted into production.

Sign up for our insideHPC Newsletter