“Learn about extensions that enable efficient use of Partitioned Global Address Space (PGAS) Models like OpenSHMEM and UPC on supercomputing clusters with NVIDIA GPUs. PGAS models are gaining attention for providing shared memory abstractions that make it easy to develop applications with dynamic and irregular communication patterns. However, the existing UPC and OpenSHMEM standards do not allow communication calls to be made directly on GPU device memory. This talk discusses simple extensions to the OpenSHMEM and UPC models to address this issue.”
“We present results for a platform consisting of an NVM Express SSD, a CAPI accelerator card and a software stack running on a Power8 system. We show how the threading of the Power8 CPU can be used to move data from the SSD to the CAPI card at very high speeds and implement accelerator functions inside the CAPI card that can process the data at these speeds.”
“E4 Computer Engineering has introduced ARKA, the first server solution based on ARM 64 bit SoC dedicated to HPC. The compute node is boosted by discrete GPU NVIDIA cards K20 with 10Gb ethernet and FDR InfiniBand networks implemented by default. In this presentation, the hardware configuration of the compute node is described in detail. The unique capabilities of the ARM+GPU+IB combination are described, including many synthetic benchmarks and application tests with particular attention to molecular dynamics software.”
“ConnectX-4 EDR 100Gb/s with CAPI support tightly integrates with the POWER CPU at the local bus level and provides faster access between the POWER CPU and the network device. We will discuss the latest interconnect advancements that maximize application performance and scalability on OpenPOWER architecture, including enhanced flexible connectivity with the latest Mellanox ConnectX-3 Pro Programmable Network Adapter.”
“The Cray XC series DataWarp applications I/O accelerator technology delivers a balanced and cohesive system architecture from compute to storage. It allocates storage dynamically in either private (dedicated) or shared modes. Storage performance quality of service can be provided to individual applications, based on the user’s policies. While leveraging Cray’s proven domain expertise in storage, the DataWarp accelerator can be used as a global storage cache for parallel file systems (PFS) such as Lustre, General Parallel File System (GPFS) and PanFS.”
Network upgrades are enabling more data-intensive science in Canada, reports WestGrid, one of four regional organizations working in partnership with Compute Canada. In the past year, the regional organization has transferred more than 5 PB of research data across Cybera and CANARIE’s advanced networks in the past year, representing a 22% increase in network traffic from 2013-14.