As multi-socket, then multi-core systems have become the standard, the Message Passing Interface (MPI) has become one of the most popular programming models for applications that can run in parallel using many sockets and cores. Shared memory programming interfaces, such as OpenMP, have allowed developers to take advantage of systems that combine many individual servers and shared memory within the server itself. However, two different programming models have been used at the same time. The MPI 3.0 standard allows for a new MPI interprocess shared memory extension (MPI SHM).
In this week’s industry Perspective, Katie Garrison of One Stop Systems explains how GPUltima allows HPC professionals to create a highly dense compute platform that delivers a petaflop of performance at greatly reduced cost and space requirements.compute power needed to quickly process the amount of data generated in intensive applications.
Matrix multiplies can be decomposed into tiles and executed very fast on the latest generations of coprocessors. Intel has developed the hStreams library that supports task concurrency on heterogeneous platforms. The concurrency may be across nodes (Xeon, KNC, KNL-SB, KNL-LB); within a node for small matrix operations; and in the overlapping of computation and communication, particularly for tiled solutions. It relieves the user of complexity in dealing with thread affinitization, offloading, memory types, and memory affinitization.
Although liquid cooling is considered by many to be the future for data centers, the fact remains that there are some who do not yet need to make a full transformation to liquid cooling. Others are restricted until the next budget cycle. Whatever the reason, new technologies like Internal Loop are more affordable than liquid cooling and can replaces less efficient air coolers. This enables HPC data centers to still utilize the highest performing CPUs and GPUs.
An interesting use of HPC technologies is in the area of understanding the propagation of radio frequency energy in an outdoor environment. “Applications of this type need to be completed in seconds to minutes to be useful. Since the tracing of each ray is independent of another ray, this type of application can be distributed easily among the many cores of the Intel Xeon Phi coprocessor.”
Lustre was originally developed as the fastest scratch file system for HPC workloads that supercomputer centers could get, but has over the years matured to be an enterprise-class parallel file system supporting mission-critical workloads. Unfortunately, in spite of Lustre having become extremely attractive to enterprises and adopted by IT departments, some naysayers continue toclaim that Lustre is still just a scratch file system. We in the Lustre community see quite a different picture.
“Applications in diverse industries such as the hospitality and retail industry, social networks and surveillance can benefit from real time image recognition. Parallelism at the system level can be divided into two main areas. First, at the database level and second at the image recognition level. The compute load per thread on the host system can just be calculated as the total number of images in the database divided by the number of threads. The image matching algorithms can then be parallelized on the coprocessor.”
“Modal is a cosmological statistical analysis package that can be optimized to take advantage of a high number of cores. The inner product computations with Modal can be run on the Intel Xeon Phi coprocessor. As a base, the entire simulation took about 6 hours on the Intel Xeon processor. Since the inner calculations are independent from each other, this lends to using the Intel Xeon Phi coprocessor.”
“Just as representative benchmarks like HPCG are set to replace Linpack, so a focus on software is taking over. From industry analysts to users at SC15 we heard that software is the number one challenge and the number one opportunity to have world-class impact.”
Altair has announced that it will provide an open source licensing option of PBS Professional® (PBS Pro). PBS Pro will become available under two different licensing options for commercial installations and as an Open Source Initiative compliant version. Altair will work closely with Intel and the Linux Foundation’s OpenHPC Collaborative Project to integrate the open source version of PBS Pro.