Experiences with NVMe over Fabrics

“Using RDMA, NVMe over Fabrics (NVMe-oF) provides the high BW and low-latency characteristics of NVMe to remote devices. Moreover, these performance traits are delivered with negligible CPU overhead as the bulk of the data transfer is conducted by RDMA. In this session, we present an overview of NVMe-oF and its implementation in Linux. We point out the main design choices and evaluate NVMe-oF performance for both Infiniband and RoCE fabrics.”

Video: Omni-Path Status, Upstreaming, and Ongoing Work

Todd Rimmer from Intel presented this talk at the OpenFabrics Workshop. “Intel Omni-Path was first released in early 2016. Omni-Path host and management software is all open sourced. This session will provide an overview of Omni-Path including some of the technical capabilities and performance results as well as some recent industry results.”

Accelerating Apache Spark with RDMA

Yuval Degani from Mellanox presented this talk at the OpenFabrics Workshop. “In this talk, we present a Java-based, RDMA network layer for Apache Spark. The implementation optimized both the RPC and the Shuffle mechanisms for RDMA. Initial benchmarking shows up to 25% improvement for Spark Applications.”

Video: State of the OpenFabrics Alliance

In this video from the OpenFabrics Workshop, Susan Coulter from LANL presents: State of the OpenFabrics Alliance. “The OpenFabrics Alliance (OFA) is an open source-based organization that develops, tests, licenses, supports and distributes OpenFabrics Software (OFS). The Alliance’s mission is to develop and promote software that enables maximum application efficiency by delivering wire-speed messaging, ultra-low latencies and maximum bandwidth directly to applications with minimal CPU overhead.”

Accelerating Hadoop, Spark, and Memcached with HPC Technologies

“This talk will present RDMA-based designs using OpenFabrics Verbs and heterogeneous storage architectures to accelerate multiple components of Hadoop (HDFS, MapReduce, RPC, and HBase), Spark and Memcached. An overview of the associated RDMA-enabled software libraries (being designed and publicly distributed as a part of the HiBD project for Apache Hadoop.”

Recent Topics in the IBTA… and a Look Ahead

Bill Magro from IBTA gave this talk at the OpenFabrics Workshop. “This talk discusses some recent activities in the InfiniBand Trade Association including recent specification updates. It also provides a glimpse into the future for the IBTA.” Bill Magro is an Intel Fellow and Intel’s Chief Technologist for HPC software. In this role, he serves as the technical lead and strategist for Intel’s high-performance computing software and provides HPC software requirements for Intel product roadmaps.”

Exascale Computing Project – Driving a HUGE Change in a Changing World

“In this keynote, Al Geist will discuss the need for future Department of Energy supercomputers to solve emerging data science and machine learning problems in addition to running traditional modeling and simulation applications. The ECP goals are intended to enable the delivery of capable exascale computers in 2022 and one early exascale system in 2021, which will foster a rich exascale ecosystem and work toward ensuring continued U.S. leadership in HPC. He will also share how the ECP plans to achieve these goals and the potential positive impacts for OFA.”

OFA Workshop Posts Session Abstracts for Austin Meeting in March

Today the OpenFabrics Alliance (OFA) published the session abstracts for its 13th Annual OFA Workshop. Sponsored by Intel, the workshop takes place March 27-31 in Austin, Texas. “The workshop will include more than 50 sessions covering a variety of critical networking topics delivered by industry experts from around the world. Additionally, the OFA has announced that Al Geist of Oak Ridge National Laboratory (ORNL) will deliver a workshop keynote address on the impact of the Exascale Computing Project. The workshop program is designed to educate attendees and encourage lively exchanges among OFA members, developers, and users who share a vested interest in high performance networks.”

Video: Intel Omni-Path Fabric Management and Tool Features

In this video from the 2016 OpenFabrics Workshop, James Wright from Intel presents: Intel Omni-Path Fabric Management and Tools Features. “The Intel Omni-Path Fabric includes a number of hardware and software features to make fabric monitoring, management and diagnosis easier. This session will provide a brief overview of the management software architecture and key features.”

Video: UPC++ Parallel Programming Extension

In this video from the 2016 OpenFabrics Workshop, Zili Zheng from LBNL presents: UPC++. “UPC++ is a parallel programming extension for developing C++ applications with the partitioned global address space (PGAS) model. UPC++ has demonstrated excellent performance and scalability with applications and benchmarks such as global seismic tomography, Hartree-Fock, BoxLib AMR framework and more. In this talk, we will give an overview of UPC++ and discuss the opportunities and challenges of leveraging modern network features.”