Sign up for our newsletter and get the latest HPC news and analysis.
Send me information from insideHPC:


OpenACC Takes Off at ISC17

Today the OpenACC standards group announced plans to showcase new advancements and increasing momentum for their programming model at ISC 2017 in Frankfurt. “OpenACC is a user-driven directive-based performance-portable parallel programming model designed for scientists and engineers interested in porting their codes to a wide-variety of heterogeneous HPC hardware platforms and architectures with significantly less programming effort than required with a low-level or explicit models.”

Allinea to Showcase V7.1 Performance Tools at ISC 2017

Today Allinea software, now part of ARM, announced plans to preview the latest update to its powerful tool suite for developing and optimizing high performance and scientific applications at ISC17 in Frankfurt. “Eliminating performance loss across systems and minimizing the communication overhead often make a critical difference in improving application run times in HPC. We believe that our extended cross-platform support will enable users to achieve unprecedented results by running their systems more efficiently on each of the major platforms and even across platforms,” said Mark O’Connor, director of product management, server and HPC tools, ARM.

Multicore Performance Challenges for Game Developers

Game developers face a unique challenge – how to make their graphics-heavy applications perform well across a very wide spectrum of hardware devices, not just high-end systems. So while an early version of a game might have been developed on some high-end system with 10 teraflops of CPU potential in a discrete graphics card, how do you scale it down to smaller consumer devices where optimization options are more limited?

Jump Start your Immersive Video Experiences

“With a wide range of Intel Xeon processors available, a truly immersive video experience can be delivered to a diverse set of end users who enjoy watching a concert or sporting event from wherever they may be located. While not an immersive VR experience, immersive video where the user has control over the viewing angle is a very important technology that can deliver exciting content to many consumers.”

The OpenMP API Celebrates 20 Years of Success

OpenMP is a good example of how hardware and software vendors, researchers, and academia, volunteering to work together, can successfully design a standard that benefits the entire developer community. Today, most software vendors track OpenMP advances closely and have implemented the latest API features in their compilers and tools. With OpenMP, application portability is assured across the latest multicore systems, including Intel Xeon Phi processors.

Liqid Delivers Composable Infrastructure Solution for Dynamic GPU Resource Allocation

Liqid Inc. has fully integrated GPU support into the Liqid Composable Infrastructure (CI) Platform. “Liqid’s CI Platform is the first solution to support GPUs as a dynamic, assignable, bare-metal resource. With the addition of graphics processing, the Liqid CI Platform delivers the industry’s most fully realized approach to composable infrastructure architecture.”

C++ Parallel STL Introduced in Intel Parallel Studio XE 2018 Beta

Parallel STL now makes it possible to transform existing sequential C++ code to take advantage of the threading and vectorization capabilities of modern hardware architectures. It does this by extending the C++ Standard Template Library with an execution policy argument that specifies the degree of threading and vectorization for each algorithm used.

Boosting Manycore Code Optimization Efforts with Roofline Technology

A software toolkit developed at Berkeley Lab to better understand supercomputer performance is now being used to boost application performance for researchers running codes at NERSC and other supercomputing facilities. “Since its initial development, what is now known as the Empirical Roofline Toolkit (ERT) has benefitted from contributions by several Berkeley Lab staff. Along the way, HPC users who write scientific applications for manycore systems have been able to apply the toolkit to their applications and see how changing parameters of their code can improve performance.”

Greg Kurtzer of LBNL Launches SingularityWare, LLC

Over at the Singularity Blog, Greg Kurtzer writes that he has created a new organization, SingularityWare, LLC. In partnership with RStor, the new company will be dedicated to further developing Singularity, supporting the associated open source community and growing the project. “In addition to continuing my leadership of Singularity (and the new LLC), I will be maintaining my association with Lawrence Berkeley National Laboratory, as a scientific advisor as well as continuing other efforts I am associated with (e.g. Warewulf and OpenHPC).”

Intel Advisor Roofline Analysis Finds New Opportunities for Optimizing Application Performance

Intel Advisor, an integral part of Intel Parallel Studio XE 2017, can help identify portions of code that could be good candidates for parallelization (both vectorization and threading). It can also help determine when it might not be appropriate to parallelize a section of code, depending on the platform, processor, and configuration it’s running on. Intel Advisor Roofline Analysis reveals the gap between an application’s performance and its expected performance.