MVAPICH at Petascale: Experiences in Production on Stampede

Print Friendly, PDF & Email

Dan Stanzione, Executive Director, TACC

Dan Stanzione, Executive Director, TACC

In this video from the 2nd Annual MVAPICH User Group (MUG) Meeting, Dan Stanzione from TACC presents: MVAPICH at Petascale: Experiences in Production on the Stampede System.

The Stampede system began production operations in January 2013. The system was one of the largest ever deployments of MVAPICH, with a 6,400 node FDR Infiniband fabric connecting more than 2PF of Intel Xeon processors. The system also was the first large scale installation of the Intel many core Xeon Phi Co-Processors, which also used MVAPICH for communications. This talk will discuss the experiences over the first 1.5 years of production with MVAPICH and Stampede. The talk will cover some science results from the system, but will focus on scaling results from the base Xeon cluster and the IB network, experiences with both native and symmetric mode MPI from the Xeon Phi over the PCI bus, and will show dramatics improvements made in performance during the production life of the system due to improvements in MVAPICH as a result of testing with Stampede.

Download the Slides (PDF)