Sign up for our newsletter and get the latest HPC news and analysis.

Monitoring and Management Interfaces for GPU Devices in a Cluster Environment

sadaf

“This presentation will provide an overview of the Nvidia Tesla Deployment Kit (TDK) from a user and a system administrator point of view. TDL contains Nvidia Management Library (NVML) and nvidia-healthmon–a tool for detecting and troubleshooting known GPU issues in a cluster environment. Usage models within a cluster environment will be presented along with a discussion on how existing resource management tools can be extended to improve allocation and accounting of GPU resources.”

This Week in HPC: HPCAC Swiss Conference and the HPC Search for Flight 370

twihpc

In this episode of This Week in HPC, Michael Feldman and Addison Snell from Intersect360 Research discuss the HPC Advisory Council Swiss conference. After the break, the topic turns to HPC and the Search for Flight 370.

Video: The OpenPower Initiative

gilad

“The OpenPOWER Foundation intends to build advanced server, networking, storage and acceleration technology aimed at delivering more choice, control and flexibility to developers of next-generation, hyperscale and cloud data centers.”

HPC Trends for 2014

Based on the latest survey data, Michael Feldman of Intersect360 Research will discuss the top trends of that emerged in 2013, with a look ahead to how the market will continue to evolve in the year ahead. Topics will include HPC in the Cloud, Big Data, evolutions in processor architectures, and the race to Exascale.

Video: ClusterStor Update

torben

“HPC storage solutions and futures continue to evolve as growth and performance requirements permeate every HPC market segment. Torben discusses these challenges and how the company’s storage solutions are addressing these shifting needs with new developments around disk drives, RAID, CIFS, security, small file handling, and other related technologies.”

Managing the GPUs of Your Cluster in a Flexible Way with rCUDA

rcuda

In this talk, we introduce the rCUDA remote GPU virtualization framework, which has been shown to be the only one that supports the most recent CUDA versions, in addition to leverage the InfiniBand interconnect for the sake of performance. Furthermore, we also present the last developments within this framework, related with the use of low-power processors, enhanced job schedulers, and virtual machine environments.”

DK Panda Presents: Programming Models for Exascale Systems

panda

“This talk will focus on programming models and their designs for upcoming exascale systems with millions of processors and accelerators. Current status and future trends of MPI and PGAS (UPC and OpenSHMEM) programming models will be presented. We will discuss challenges in designing runtime environments for these programming models by taking into account support for multi-core, high-performance networks, GPGPUs, Intel MIC, scalable collectives (multi-core-aware, topology-aware, and power-aware), non-blocking collectives using Offload framework, one-sided RMA operations, schemes and architectures for fault-tolerance/fault-resilience.”

Monitoring and Management Interfaces for GPU Devices in a Cluster Environment

1529987_10153950357685284_1755270361_o

“This presentation will provide an overview of the Nvidia Tesla Deployment Kit (TDK) from a user and a system administrator point of view. TDL contains Nvidia Management Library (NVML) and nvidia-healthmon–a tool for detecting and troubleshooting known GPU issues in a cluster environment. Usage models within a cluster environment will be presented along with a discussion on how existing resource management tools can be extended to improve allocation and accounting of GPU resources.”

InfiniBand Principles Every HPC Expert MUST Know

oded

In this video from the HPC Advisory Council Swiss Conference 2014, Oded Paz from Mellanox Global Education Services presents: InfiniBand Principles Every HPC Expert MUST Know.

GTC 2014 Keynote Time Lapse from insideHPC

timelapse

I produced this time lapse video featuring clips from the opening keynote at GTC 2014. “Nvidia really knows how to wow the audience with a blend of showmanship, state-of-the-art graphics, and dazzling technology.”