Sign up for our newsletter and get the latest HPC news and analysis.
Send me information from insideHPC:


Podcast: Delivering Exascale Machine Learning Algorithms at the ExaLearn Project

In this Let’s Talk Exascale podcast, researchers from the ECP describe progress at the ExaLearn project. ExaLearn is focused on ML and related activities to inform the requirements for these pending exascale machines. “ExaLearn’s algorithms and tools will be used by the ECP applications, other ECP co-design centers, and DOE experimental facilities and leadership-class computing facilities.”

Software-defined Microarchitecture: An Arguably Terrible Idea, But Certainly Not The Worst Idea

James Mickens from Harvard University gave this talk at HiPEAC 2020. “In this presentation, I will describe some of the benefits that would emerge from a new kind of processor that aggressively exposes microarchitectural state and allows it to be programmed. Using elaborate hand gestures and cheap pleas for sympathy, I will explain why my proposals are different than prior “open microarchitecture” ideas like transport-triggered designs.”

Precision Medicine pushes demand for HPC at the Edge: AI on the Fly ® Delivers

In this special guest feature, Tim Miller from One Stop Systems writes that by bringing specialized, high performance computing capabilities to the edge through AI on the Fly, OSS is helping the industry deliver on the enormous potential of precision medicine. “The common elements of these solutions are high data rate acquisition, high speed low latency storage, and efficient high performance compute analytics. With OSS, all of these building block elements are connected seamlessly with memory mapped PCI Express interconnect configured and customized as appropriate, to meet the specific environmental requirements of ‘in the field’ installations.”

Podcast: One Big Debate over OneAPI

In this podcast, the Radio Free HPC team looks at Intel’s oneAPI project. “The OneAPI project is a highly ambitious initiative; trying to design a single API to handle CPUs, GPUs, FPGAs, and other types of processors. In the discussion, we look under the hood and see how this might work. One thing working in Intel’s favor is that they’re using data parallel C++, which is highly compatible with CUDA – and which is probably Intel’s target with this new initiative.”

How Supersonic Commercial Flight is Possible with Big Compute

In this video from Big Compute 2020, Blake Scholl from Boom Supersonic describes how high performance computing in the cloud has opened a new era of high-speed flight. “We’ve done about 66 million core hours of computing, mainly through Rescale since we started the design effort on XB-1. And if you asked yourself what that would look like and wind tunnel testing, it would be financially and timewise just absolutely impractical.”

Latest Release of Intel Parallel Studio XE Delivers New Features to Boost HPC and AI Performance

Intel Parallel Studio XE is a complete software development suite that includes highly optimized compilers and math and data analytics libraries, along with comprehensive tools for performance analysis, application debugging, and parallel processing. It’s available as a download for Windows, Linux, and MacOS. “With this release, the focus is on making it easier for HPC and AI developers to deliver fast and reliable parallel code for the most demanding applications.”

Podcast: Applying Deep Learning to Extreme Weather

“A research team from Rice University utilized three supercomputers (TACC’s Stampede 2, Wrangler, and Pittsburg Supercomputing Center’s Bridges system) to see if data on heat waves and cold spells could be predicted by analysis of atmospheric circulation and prior surface temperature. The results of these tests indicated that this deep learning approach is more accurate at predicting extreme weather.”

Podcast: How Community Collaboration Drives Compiler Technology at the LLVM Project

In this Let’s Talk Exascale podcast, Hal Finkel of Argonne National Laboratory describes how community collaboration is driving compiler infrastructure at the LLVM project. “LLVM is important to a wide swath of technology professionals. Contributions shaping its development have come from individuals, academia, DOE and other government entities, and industry, including some of the most prominent tech companies in the world, both inside and outside of the traditional high-performance computing space.”

Video: Next Generation AMD CPU and Accelerator Technologies

Bradley McCredie from AMD gave this talk at the Rice Oil & Gas Conference. “While foundries bravely drive forward to overcome the technical and economic challenges posed by scaling to 5nm and beyond, Moore’s law alone can provide only a fraction of the performance / watt and performance / dollar gains needed to satisfy the demands of today’s high performance computing and artificial intelligence applications. To close the gap, multiple strategies are required. First, new levels of innovation and design efficiency will supplement technology gains to continue to deliver meaningful improvements in SoC performance. Second, heterogenous compute architectures will create x-factor increases of performance efficiency for the most critical applications.”

Podcast: Slingshotting to Exascale

In this podcast, the Radio Free HPC team looks at the Cray Slingshot interconnect that will power all three of the first Exascale supercomputers in the US. “At the heart of this new interconnect is their innovative 64 port switch that provides a maximum of 200 Gb/s per port and can support Cray’s enhanced Ethernet along with standard Ethernet message passing. It also has advanced congestion control and quality of service modes that ensure that each job gets their right amount of bandwidth.”