MailChimp Developer

Sign up for our newsletter and get the latest HPC news and analysis.
Send me information from insideHPC:


OpenACC Toolkit is now Free for Academia

openacc

Over at the Nvidia Blog, Paresh Kharya writes that the company is releasing its OpenACC Toolkit as a free download for Academia.

Video: RDMA Container Support

liran

In this video from the 2015 OFS Workshop, Liran Liss from Mellanox presents: RDMA Container Support.

Video: Intel Vector Advisor Unlocks Code Performance

studio

In this video, Rick Leinecker from Slashdot Media describes the Vectorization Advisor, one of the new additions to Intel Parallel Studio XE suite. “Vectorization Advisor is an analysis tool that lets you identify if loops utilize modern SIMD instructions or not, what prevents vectorization, what is performance efficiency and how to increase it. Vectorization Advisor shows compiler optimization reports in user-friendly way, and extends them with multiple other metrics, like loop trip counts, CPU time, memory access patterns and recommendations for optimization.”

Interview: How Univa Short Jobs Brings Low Latency to Financial Services

Gary Tyreman, CEO, Univa

With the launch of Univa Small Jobs add-on for Univa Grid Engine, the company, the company offers “the world’s most efficient processing and lowest latency available for important tasks like real-time trading, transactions, and other critical applications.” To learn more, we caught up with Univa President & CEO Gary Tyreman.

Computing With MPI in Heterogeneous Environments

Stampede

Designating the appropriate provider for large MPI applications is critical to taking advantage of all of the compute power available. “A modern HPC system with multiple host cpus and multiple coprocessors such as the Intel Xeon Phi coprocessor housed in numerous racks can be optimized for maximum application performance with intelligent thread placement.”

Hero Performance is about the Applications

David Lecomber, CEO, Allinea

During last month’s PRACE Days in Dublin – where I enjoyed talks on improvements in codes and methods in areas as diverse as CFD, RTM in geophysics, and in genomics – I saw once again that “hero” performance improvements happen and happen regularly.

Video: Three Ways to Debug Parallel CUDA Applications

0

“This talk will introduce these three debugging techniques and provide some suggestions on selecting the optimal approach for a variety of debugging scenarios such as hangs, numerical errors, and crashes. Specific examples will be given using the TotalView debugger but the concepts covered may apply to other debugging tools such as GDB and the NVIDIA NSIGHT debugger.”

Video: Beta Review of Intel Parallel Studio XE 2016

Int_DPD_TEC_ProductGraphic_PSXE

In this video, Rick Leinecker from Slashdot Media reviews the beta version of Intel Parallel Studio XE 2016. Leinecker describes several of the notable features and updates, including OpenMP enhancements, vastly improved computer vision and image processing, and the Data Analytics Acceleration Library.

ArrayFire Updates GPU Software Library

arrayfire

Today ArrayFire announced the release of Version 3.0 of their high-speed software library for GPU computing. The new version features major changes to ArrayFire’s visualization library, a new CPU backend, and dense linear algebra for OpenCL devices. It also includes improvements across the board for ArrayFire’s OpenCL backend.

Video: Porting Scientific Apps to GPUs with OpenACC

seismic

In this video from the GPU Technology Conference, Saber Feki from KAUST and Ahmed Al-Jarro from Fujitsu Labs in Europe present: Experiences in Porting Scientific Applications to GPUs Using OpenACC.