“To achieve good scalability performance on the HPC scientific applications typically involves good understanding of the workload though performing profile analysis, and comparing behaviors of using different hardware which pinpoint bottlenecks in different areas of the HPC cluster. In this session, a selection of HPC applications will be shown to demonstrate various methods of profiling and analysis to determine the bottleneck, and the effectiveness of the tuning to improve on the application performance.”
“The CMS detector at the Large Hadron Collider at CERN underwent a replacement of its data acquisition network to be able to process the increased data rate expected in the coming years. We will present the architecture of the system and discuss the design of its layers which are based on Infiniband as well as 10 and 40 GBit/s Ethernet.”
“The use of GPUs to accelerate applications is mainstream nowadays, but their adoption in cur- rent clusters presents several drawbacks. In this talk we present the last developments of the rCUDA remote GPU virtualization framework, which is the only one supporting the most recent CUDA version, in addition to leverage the InfiniBand fabric for the sake of performance.”
“Adaptive Routing has been added to the static routing capability available in previous switch families. InfiniBand supports moving traffic via multiple parallel paths. Adaptive routing dynamically and automatically re-routes traffic to alleviate congested ports. In networks where traffic patterns are more predictable, static routing has been shown to produce superior results. The InfiniScale IV architecture provides the best of both static and adaptive routing.”
Watch to get the latest on the Coral supercomputer coming to ORNL. “ORNL’s supercomputing program has grown from humble beginnings to deliver some of the most powerful systems in the world. On the way, it has helped researchers deliver practical breakthroughs and new scientific knowledge in climate, materials, nuclear science, and a wide range of other disciplines.”
CORAL (Collaboration of Oak Ridge, Argonne and Lawrence Livermore National Labs) is a project that was launched in 2013 to develop the technology and meet the Department of Energy’s 2017-2018 leadership computing needs with supercomputers. The collaboration between Mellanox, IBM and NVIDIA was selected by the CORAL project team after a comprehensive evaluation of future technologies from a variety of vendors. The development of these supercomputers is well underway with installation expected in 2017.