Sign up for our newsletter and get the latest HPC news and analysis.

Video: CUDA 6 and Beyond

Mark Harris

In this video, Nvidia’s Mark Harris, provides a detailed look at the top new features of CUDA 6, including a deep-dive review of Unified Memory, which makes GPU programming easier by automatically migrating data between the CPU and GPU.

Big Data Visualization with TACC Maverick

kelly

Over at VizWorld, Dean Meyers writes that the new Maverick system at TACC is a unique, powerful, high performance visualization and data analytics resource for the open science and engineering community.

Marc Hamilton Looks at China HPC

“Like the US, Japan, and Europe, China still has plans to build giant HPC systems like Tianhe. However, increasingly these systems are being looked at to support commercial HPC workloads like machine vision in a cloud environment in addition to just scientific data processing.”

GPU Acceleration Benefits for Applied CAE

Axel

“This presentation examines the HPC performance characteristics of CAE software, and the current state of GPU parallel solvers in commercial CAE that support product design in manufacturing industries. Case studies from industry will be presented that include HPC adoption of GPUs for production CAE and HPC technology and the benefits they provide. Rapid simulation from GPUs demonstrates the potential of a novel HPC technology that can transform current practices in engineering analysis and design optimization procedures.”

Ingram Micro Artizen Soutions for HPC

rick

In this video from the GPU Technology Conference, Rick Young from Ingram Micro describes the company’s Artizen HPC solutions. “Available now to channel partners in the U.S., the distributor’s new and exclusive line of Artizen High Performance Computing (HPC) offerings include turnkey high performance servers, ultimate workstations, and customizable supercomputing clusters, as well as computing integration and software installation services.”

Monitoring and Management Interfaces for GPU Devices in a Cluster Environment

sadaf

“This presentation will provide an overview of the Nvidia Tesla Deployment Kit (TDK) from a user and a system administrator point of view. TDL contains Nvidia Management Library (NVML) and nvidia-healthmon–a tool for detecting and troubleshooting known GPU issues in a cluster environment. Usage models within a cluster environment will be presented along with a discussion on how existing resource management tools can be extended to improve allocation and accounting of GPU resources.”

insideHPC Performance Guru Looks at Nvidia’s New NVLink

Bill D'Amico

“For NVLink to have its highest value it must function properly with unified memory. That means that the Memory Management Units in the CPUs have to be aware of NVLink DMA operations and update appropriate VM structures. The operating system needs to know when memory pages have been altered via NVLink DMA – and this can’t be solely the responsibility of the drivers. Tool developers also need to know details so that MPI or other communications protocols can make use of the new interconnect.”

Managing the GPUs of Your Cluster in a Flexible Way with rCUDA

rcuda

In this talk, we introduce the rCUDA remote GPU virtualization framework, which has been shown to be the only one that supports the most recent CUDA versions, in addition to leverage the InfiniBand interconnect for the sake of performance. Furthermore, we also present the last developments within this framework, related with the use of low-power processors, enhanced job schedulers, and virtual machine environments.”

High Performance Computing at CSCS

Schulthess

“With around 3.2 billion computer operations (3.2 gigaflops) per watt, the combination of GPUs CPUs makes “Piz Daint” one of the world’s most energy-­efficient supercomputers in the petaflop performance class.”

Monitoring and Management Interfaces for GPU Devices in a Cluster Environment

1529987_10153950357685284_1755270361_o

“This presentation will provide an overview of the Nvidia Tesla Deployment Kit (TDK) from a user and a system administrator point of view. TDL contains Nvidia Management Library (NVML) and nvidia-healthmon–a tool for detecting and troubleshooting known GPU issues in a cluster environment. Usage models within a cluster environment will be presented along with a discussion on how existing resource management tools can be extended to improve allocation and accounting of GPU resources.”