MacGyver is a character famous for getting out of tough situations by coming up with a makeshift solution from whatever bits and pieces he found laying around. That’s no way to manage a critical part of your organization’s infrastructure.
“This presentation will provide an overview of the Nvidia Tesla Deployment Kit (TDK) from a user and a system administrator point of view. TDL contains Nvidia Management Library (NVML) and nvidia-healthmon–a tool for detecting and troubleshooting known GPU issues in a cluster environment. Usage models within a cluster environment will be presented along with a discussion on how existing resource management tools can be extended to improve allocation and accounting of GPU resources.”
“OpenACC is gaining momentum and adoption,” said Duncan Poole, President of the OpenACC Standards Group. “Developers benefit because using OpenACC directives makes parallel programming more productive and collaboration easier. Large, legacy codes are easier to maintain and accelerated code is more portable across HPC systems.”