Sign up for our newsletter and get the latest HPC news and analysis.
Send me information from insideHPC:


Podcast: DoE Awards $258 Million for Exascale to U.S. HPC Vendors

Today U.S. Secretary of Energy Rick Perry announced that six leading U.S. technology companies will receive funding from the Department of Energy’s Exascale Computing Project (ECP) as part of its new PathForward program, accelerating the research necessary to deploy the nation’s first exascale supercomputers. “Continued U.S. leadership in high performance computing is essential to our security, prosperity, and economic competitiveness as a nation,” said Secretary Perry. “These awards will enable leading U.S. technology firms to marshal their formidable skills, expertise, and resources in the global race for the next stage in supercomputing—exascale-capable systems.”

Mellanox InfiniBand Delivers up to 250 Percent Higher ROI for HPC

Today Mellanox announced that EDR 100Gb/s InfiniBand solutions have demonstrated from 30 to 250 percent higher HPC applications performance versus Omni-Path. These performance tests were conducted at end-user installations and Mellanox benchmarking and research center, and covered a variety of HPC application segments including automotive, climate research, chemistry, bioscience, genomics and more.

GEN-Z: An Overview and Use Cases

Greg Casey from Dell EMC presented this talk at the OpenFabrics Workshop. “This session will focus on the new Gen-Z memory-semantic fabric. The speaker will show the audience why Gen-Z is needed, how Gen-Z operates, what is expected in first products that employ Gen-Z, and encourage participation in finalizing the Gen-Z specifications. Gen-Z will be connecting components inside of servers as well as connecting servers with pools of memory, storage, and acceleration devices through a switch environment.”

Radio Free HPC Looks at Azure’s Move to GPUs and OCP for Deep Learning

In this podcast, the Radio Free HPC team looks at a set of IT and Science stories. Microsoft Azure is making a big move to GPUs and the OCP Platform as part of their Project Olympus. Meanwhile, Huawei is gaining market share in the server market and IBM is bringing storage to the atomic level.

AMD Unveils Vega GPU Architecure with HBM Memory

Today AMD unveiled preliminary details of its forthcoming GPU architecture, Vega. Conceived and executed over 5 years, Vega architecture enables new possibilities in PC gaming, professional design and machine intelligence that traditional GPU architectures have not been able to address effectively. “It is incredible to see GPUs being used to solve gigabyte-scale data problems in gaming to exabyte-scale data problems in machine intelligence. We designed the Vega architecture to build on this ability, with the flexibility to address the extraordinary breadth of problems GPUs will be solving not only today but also five years from now. Our high-bandwidth cache is a pivotal disruption that has the potential to impact the whole GPU market,” said Raja Koduri, senior vice president and chief architect, Radeon Technologies Group, AMD.

New AMD Radeon Instinct Rolls Out to Accelerate Machine Intelligence

“New Radeon Instinct accelerators will offer organizations powerful GPU-based solutions for deep learning inference and training. Along with the new hardware offerings, AMD announced MIOpen, a free, open-source library for GPU accelerators intended to enable high-performance machine intelligence implementations, and new, optimized deep learning frameworks on AMD’s ROCm software to build the foundation of the next evolution of machine intelligence workloads.”

HIP and CAFFE Porting and Profiling with AMD’s ROCm

In this video from SC16, Ben Sander from AMD presents: HIP and CAFFE Porting and Profiling with AMD’s ROCm. “We are excited to present ROCm, the first open-source HPC/Hyperscale-class platform for GPU computing that’s also programming-language independent. We are bringing the UNIX philosophy of choice, minimalism and modular software development to GPU computing. The new ROCm foundation lets you choose or even develop tools and a language run time for your application. ROCm is built for scale; it supports multi-GPU computing in and out of server-node communication through RDMA.”

Radio Free HPC Looks into the New OpenCAPI Consortium

In this podcast, the Radio Free HPC team looks at the new OpenCAPI interconnect standard. “Released this week by the newly formed OpenCAPI Consortium, OpenCAPI provides an open, high-speed pathway for different types of technology – advanced memory, accelerators, networking and storage – to more tightly integrate their functions within servers. This data-centric approach to server design, which puts the compute power closer to the data, removes inefficiencies in traditional system architectures to help eliminate system bottlenecks and can significantly improve server performance.”

AMD GPUs to Speed Alibaba Cloud

​Today AMD announced that the Alibaba Cloud will use AMD Radeon Pro GPU technology to help expand its cloud computing offerings and accelerate adoption of its cloud-based services. “The partnership between AMD and Alibaba Cloud will bring both of our customers more diversified, cloud-based graphic processing solutions. It is our vision to work together with leading technology firms like AMD to empower businesses in every industry with cutting-edge technologies and computing capabilities,” said Simon Hu, president of Alibaba Cloud.

New OpenCAPI Consortium to Boost Server Performance 10x

“IBM has decided to double down on our commitment to open standards and enablement of industry innovation by opening up access to our CAPI technology to the entire industry. With the support of our OpenCAPI co-founders, we have created a new OpenCAPI specification that tremendously improves performance over our prior specification and IBM will be among the first to implement it with our POWER9 products expected in 2017.”