In this slidecast, David Eaton and Mark Tellez from SRC Computer describe the company’s new Saturn 1 server, a dynamically reconfigurable modular system for hyperscale data centers and Web operations.
“This talk will discuss various strategies to make a serial Python code faster, for example using libraries like NumPy, or tools like Cython which compile Python code. The talk will also discuss the available tools for running Python in parallel, focusing on the mpi4py module which implements MPI (Message Passing Interface) in Python.”
“Learn about extensions that enable efficient use of Partitioned Global Address Space (PGAS) Models like OpenSHMEM and UPC on supercomputing clusters with NVIDIA GPUs. PGAS models are gaining attention for providing shared memory abstractions that make it easy to develop applications with dynamic and irregular communication patterns. However, the existing UPC and OpenSHMEM standards do not allow communication calls to be made directly on GPU device memory. This talk discusses simple extensions to the OpenSHMEM and UPC models to address this issue.”
“Our computing systems continue to evolve, providing significant challenges to the programming teams managing large, long-lived projects. Issues include rapidly increasing on-node parallelism, varying forms of heterogeneity, deepening memory hierarchies, growing concerns around resiliency and silent data corruption, and worsening storage bottlenecks.”
Learn how OpenACC runtimes also exposes performance-related information revealing where your OpenACC applications are wasting clock cycles. The talk will show that profilers can connect with OpenACC applications to record how much time is spent in OpenACC regions and what device activity it turns into.