Sign up for our newsletter and get the latest HPC news and analysis.
Send me information from insideHPC:

New, Open DPC++ Extensions Complement SYCL and C++

In this guest article, our friends at Intel discuss how accelerated computing has diversified over the past several years given advances in CPU, GPU, FPGA, and AI technologies. This innovation drives the need for an open and cross-platform language that allows developers to realize the potential of new hardware, minimizes development cost and complexity, and maximizes reuse of their software investments.

Video: Preparing to program Aurora at Exascale – Early experiences and future directions

Hal Finkel from Argonne gave this talk at IWOCL / SYCLcon 2020. “Argonne National Laboratory’s Leadership Computing Facility will be home to Aurora, our first exascale supercomputer. This presentation will summarize the experiences of our team as we prepare for Aurora, exploring how to port applications to Aurora’s architecture and programming models, and distilling the challenges and best practices we’ve developed to date.”

Latest Release of Intel Parallel Studio XE Delivers New Features to Boost HPC and AI Performance

Intel Parallel Studio XE is a complete software development suite that includes highly optimized compilers and math and data analytics libraries, along with comprehensive tools for performance analysis, application debugging, and parallel processing. It’s available as a download for Windows, Linux, and MacOS. “With this release, the focus is on making it easier for HPC and AI developers to deliver fast and reliable parallel code for the most demanding applications.”

An Alternative to OpenMP and an On-Ramp to Future C++ Standards

In this edition of Let’s Talk Exascale, Christian Trott of Sandia National Laboratories shares insights about Kokkos, a programming model for numerous Exascale Computing Project applications. “Kokkos is a programming model being developed to deliver a widely usable alternative to programming in OpenMP. It is expected to be easier to use and provide a higher degree of performance portability, while integrating better into C++ codes.”

Video: The Kokkos C++ Performance Portability EcoSystem for Exascale

Christian Trott from Sandia gave this talk at the GPU Technology Conference. “The Kokkos C++ Performance Portability EcoSystem is a production-level solution for writing modern C++ applications in a hardware-agnostic way. We’ll provide success stories for Kokkos adoption in large production applications on the leading supercomputing platforms in the U.S. We’ll focus particularly on early results from two of the world’s most powerful supercomputers, Summit and Sierra, both powered by NVIDIA Tesla V100 GPUs.”

Expressing Parallelism in C++ with Threading Building Blocks

Parallelism helps applications make the best use of processors on single or multiple devices. However, parallelism implementation itself can prove a challenging task. In this video, Mike Voss, principal engineer with the Core and Visual Computing Group at Intel discusses the benefits of Intel® Threading Building Blocks (Intel® TBB), a C++ library, and how it can simplify the work of adding parallelism without the need to probe into threading details.

Codeplay Releases First Fully-Conformant SYCL 1.2.1 Solution for C++

SYCL is an open standard developed by the Khronos Group that enables developers to write code for heterogeneous systems using standard C++. Developers are looking at how they can accelerate their applications without having to write optimized processor specific code. SYCL is the industry standard for C++ acceleration, giving developers a platform to write high-performance code in standard C++, unlocking the performance of accelerators and specialized processors from companies such as AMD, Intel, Renesas, and Arm.

Unlocking the Power of Parallel Coding to Access Better Performance in Multi-Core Environments

A number of different frameworks and standards can be employed for parallel coding. The choice of the most suitable depends on the purpose of the application, its overall requirements and the target execution environment. Selecting the right framework is imperative to obtaining the best possible performance increase. The choice of framework is based on the available memory, overheads, controls and support.

Intel Parallel Studio XE 2018 Released

Intel has announced the release of Intel® Parallel Studio XE 2018, with updated compilers and developer tools. It is now available for downloading on a 30-day trial basis. ” This week’s formal release of the fully supported product is notable with new features that further enhance the toolset for accelerating HPC applications.”

C++ Parallel STL Introduced in Intel Parallel Studio XE 2018 Beta

Parallel STL now makes it possible to transform existing sequential C++ code to take advantage of the threading and vectorization capabilities of modern hardware architectures. It does this by extending the C++ Standard Template Library with an execution policy argument that specifies the degree of threading and vectorization for each algorithm used.