Sign up for our newsletter and get the latest HPC news and analysis.
Send me information from insideHPC:


Cowboy Supercomputer Powers Research at Oklahoma State

In this video, Dana Brunson from Oklahoma State describes the mission of the Oklahoma High Performance Computing Center. Formed in 2007, the HPCC facilitates computational and data-intensive research across a wide variety of disciplines by providing students, faculty and staff with cyberinfrastructure resources, cloud services, education and training, bioinformatics assistance, proposal support and collaboration.

Managing Node Configuration with 1000s of Nodes

Ira Weiny from Intel presented this talk at the OpenFabrics Workshop. “Individual node configuration when managing 1000s or 10s of thousands of nodes in a cluster can be a daunting challenge. Two key daemons are now part of the rdma-core package which aid the management of individual nodes in a large fabric: IBACM and rdma-ndd.”

Dr. Robert Voigt on Educating Computational Scientists

In this video from KAUST Live, Dr. Robert Voigt discusses his recent keynote at the HPC Saudi Conference on the topic of Educating Computational Scientists. “This talk will provide a historical perspective on the challenges of educating computational scientists based on my personal involvement over a number of years. Three decidedly different activities will be drawn on to indicate how one can successfully approach the challenge.”

Video: InfiniBand Virtualization

“Infiniband Virtualization allows a single Channel Adapter to present multiple transport endpoints that share the same physical port. To software, these endpoints are exposed as independent Virtual HCAs (VHCAs), and thus may be assigned to different software entities, such as VMs. VHCAs are visible to Subnet Management, and are managed just like physical HCAs. We will cover the Virtualization model, management, addressing modes, and discuss deployment considerations.”

Building Efficient HPC Clouds with MCAPICH2 and RDMA-Hadoop over SR-IOV IB Clusters

Xiaoyi Lu from Ohio State University presented this talk at the Open Fabrics Workshop. “Single Root I/O Virtualization (SR-IOV) technology has been steadily gaining momentum for high performance interconnects such as InfiniBand. SR-IOV can deliver near native performance but lacks locality-aware communication support. This talk presents an efficient approach to building HPC clouds based on MVAPICH2 and RDMA-Hadoop with SR-IOV.”

GEN-Z: An Overview and Use Cases

Greg Casey from Dell EMC presented this talk at the OpenFabrics Workshop. “This session will focus on the new Gen-Z memory-semantic fabric. The speaker will show the audience why Gen-Z is needed, how Gen-Z operates, what is expected in first products that employ Gen-Z, and encourage participation in finalizing the Gen-Z specifications. Gen-Z will be connecting components inside of servers as well as connecting servers with pools of memory, storage, and acceleration devices through a switch environment.”

Experiences with NVMe over Fabrics

“Using RDMA, NVMe over Fabrics (NVMe-oF) provides the high BW and low-latency characteristics of NVMe to remote devices. Moreover, these performance traits are delivered with negligible CPU overhead as the bulk of the data transfer is conducted by RDMA. In this session, we present an overview of NVMe-oF and its implementation in Linux. We point out the main design choices and evaluate NVMe-oF performance for both Infiniband and RoCE fabrics.”

Video: Thomas Schulthess on how HPC Propels the Global Enterprise of Science

In this video from the HPC Saudi Conference, Dr. Thomas Schulthess from the Swiss National Supercomputing Center discusses how CSCS approaches High Performance Computing. According to Schulthess, supporting legacy software is the biggest challenge for moving HPC forward. “Earlier this month, the European PRACE initiative went into Phase 2, with Switzerland becoming a new Hosting Member. As a Hosting Member, Switzerland is now making its Piz Daint supercomputer at CSCS available for cutting-edge PRACE research. The other Hosting Members are Spain, Italy, Germany and France.”

Video: Omni-Path Status, Upstreaming, and Ongoing Work

Todd Rimmer from Intel presented this talk at the OpenFabrics Workshop. “Intel Omni-Path was first released in early 2016. Omni-Path host and management software is all open sourced. This session will provide an overview of Omni-Path including some of the technical capabilities and performance results as well as some recent industry results.”

Accelerating Apache Spark with RDMA

Yuval Degani from Mellanox presented this talk at the OpenFabrics Workshop. “In this talk, we present a Java-based, RDMA network layer for Apache Spark. The implementation optimized both the RPC and the Shuffle mechanisms for RDMA. Initial benchmarking shows up to 25% improvement for Spark Applications.”