Sign up for our newsletter and get the latest HPC news and analysis.
Send me information from insideHPC:


GPUs & Deep Learning in the Spotlight for Nvidia at SC16

In this video from SC16, Roy Kim from Nvidia describes how the company is bringing in a new age of AI with accelerated computing for Deep Learning applications. “Deep learning is the fastest-growing field in artificial intelligence, helping computers make sense of infinite amounts of data in the form of images, sound, and text. Using multiple levels of neural networks, computers now have the capacity to see, learn, and react to complex situations as well or better than humans. This is leading to a profoundly different way of thinking about your data, your technology, and the products and services you deliver.”

HPE Apollo 6500 for Deep Learning

“With up to eight high performance NVIDIA GPUs designed for maximum transfer bandwidth, the HPE Apollo 6500 is purpose-built for HPC and deep learning applications. Its high ratio of GPUs to CPUs, dense 4U form factor and efficient design enable organizations to run deep learning recommendation algorithms faster and more efficiently, significantly reducing model training time and accelerating the delivery of real-time results, all while controlling costs.”

Kx Streaming Analytics Crunches 1.2 Billion NYC Taxi Data Points using Intel Xeon Phi

“The complexity and high costs of architecting and maintaining streaming analytics solutions often make it difficult to get new projects off the ground. That’s part of the reason Kx, a leading provider of high-volume, high-performance databases and real-time analytics solutions, is always interested in exploring how new technologies may help it push streaming analytics performance and efficiency boundaries. The Intel Xeon Phi processor is a case in point. At SC16 in Salt Lake City, Kx used a 1.2 billion record database of New York City taxi cab ride data to demonstrate what the Intel Xeon Phi processor could mean to distributed big data processing. And the potential cost/performance implications were quite promising.”

Cray Sets New Supercomputing Record with HLRS and Ansys

ANSYS, HLRS and Cray have pushed the boundaries of supercomputing by achieving a new supercomputing milestone by scaling ANSYS software to 172,032 cores on the Cray XC40 supercomputer, hosted at HLRS, running at 82 percent efficiency. This is nearly a 5x increase over the record set two years ago when Fluent was scaled to 36,000 cores. “This record-setting scaling of ANSYS software on the Cray XC40 supercomputer at HLRS proves that close collaborations with customers and partners can produce exceptional results for running complex simulations,” said Fred Kohout, senior vice president and chief marketing officer at Cray.

HIP and CAFFE Porting and Profiling with AMD’s ROCm

In this video from SC16, Ben Sander from AMD presents: HIP and CAFFE Porting and Profiling with AMD’s ROCm. “We are excited to present ROCm, the first open-source HPC/Hyperscale-class platform for GPU computing that’s also programming-language independent. We are bringing the UNIX philosophy of choice, minimalism and modular software development to GPU computing. The new ROCm foundation lets you choose or even develop tools and a language run time for your application. ROCm is built for scale; it supports multi-GPU computing in and out of server-node communication through RDMA.”

RAID Inc. Steps up with ZFS on Lustre at SC16

In this video from SC16, Brad Merchant from RAID Inc. describes the company’s new Lustre ZFS Building Block. “RAID Inc. offers a suite of building block product families that can be purchased individually or in conjunction with other RAID products to solve customer’s needs in the most demanding data-storage environments. Each product is customized to address customer’s individual requirements of performance, reliability, scalability and price. Each product is put through extensive testing and a burn-in/staging process which ensures customers will receive a solution designed to function as specified in their unique environment.”

DDN IME Burst Buffer Exceeds 1 TB/s for Japan’s Fastest Supercomputer

“Storage performance has been one of the biggest challenges in developing supercomputers. To meet the demands for storage performance, IME was introduced to the Oakforest-PACS on a massive scale, the first such introduction in the world,” said Osamu Tatebe, lead, public relations, JCAHPC / professor, Center for Computational Sciences, University of Tsukuba. “We are very pleased that we could achieve effective I/O performance exceeding 1 TB per second in writing tens of thousands of processes to the same file. With this new storage technology, we believe that we will be able to contribute to society with the further development of computational science, big data analysis and machine learning.”

Silicon Mechanics: HPC Built for You at SC16

In this video from SC16, Silicon Mechanics CTO Daniel Chow describes how the company brings value and performance to its HPC customers. “When looking for a leading solutions integrator to couple disparate hardware and software products into a “HPC Built For You” solution, that will keep up with the evolution and disruptive forces in technology – the Experts at Silicon Mechanics are here to help you.”

DDN Re-Energizes HPC Workflows at SC16

“At SC16, DDN demonstrated centralized storage implementations that accelerate financial and scientific analytics by 5x – extracting maximum efficiency, value and ROI in real-world customer environments using DDN products and solutions with open and commercial software including OpenStack and Hadoop. In addition to the demonstrations, DDN also highlighted its customer use cases with other open and commercial software such as Informatica, SAS GRID, R and Ab Initio, among others.”

Huawei Showcases HPC Solutions at SC16

“Huawei has increasingly become more prominent in the HPC market. It has successfully deployed HPC clusters for a large number of global vehicle producers, large-scale supercomputing centers, and research institutions. These show that Huawei’s HPC platforms are optimized for industry applications which can help customers significantly simplify service processes and improve work efficiency, enabling them to focus on product development and research.”