Search Results for: CPU

Exascale Computing Project: RAJA Portability Suite Enables Performance Portable CPU and GPU HPC Codes

A growing number of HPC applications must deliver high performance on CPU and GPU hardware platforms.  One software tool available now and showing tremendous promise for the exascale era is the open-source RAJA Portability Suite. RAJA is part of the Exascale Computing Project (ECP) NNSA software portfolio and is also supported by the ECP Programming Models and Runtimes area.

Rice Univ. Researchers Claim 15x AI Model Training Speed-up Using CPUs

Reports are circulating in AI circles that researchers from Rice University claim a breakthrough in AI model training acceleration – without using accelerators. Running AI software on commodity x86 CPUs, the Rice computer science team say  neural networks can be trained 15x faster than platforms utilizing GPUs. If valid, the new approach would be a double boon for organizations implementing AI strategies: faster model training using less costly microprocessors.

Los Alamos National Lab 1st to Receive Nvidia ‘Grace’ Arm CPU-based  Supercomputer

April 12, 2021 – Los Alamos National Laboratory will be the first United States customer to receive a supercomputer based on Nvidia’s new “Grace” Arm-based CPU, announced today, with Hewlett Packard Enterprise (HPE) as the system provider. Delivery is targeted for early 2023. The Grace CPU is an Arm-based processor built for data-intensive applications requiring […]

Nvidia Jumps into Data Center CPUs, Also Launches DPUs, GPUs and DGX Superpod for HPC and AI

Nvidia announced a slew of new products today on the first day of its annual GTC conference, including a first-time foray into data center CPU that has pushed Intel stock prices down and Nvidia shares up. Nvidia’s new “Grace” CPU, the company’s first CPU for the data center, is an Arm-based chip that the company […]

Advanced Clustering Technologies Announces ACTblade x3XX HPC Systems Based on 3rd Gen Intel Xeon CPUs 

April 6, 2021 — Advanced Clustering Technologies today announced the ACTblade x3XX family of dense HPC systems, which are based on the new third generation of Intel Xeon Scalable Processors, code named Ice Lake. Ice Lake is a significant advancement over the previous generation, offering more cores, more clock, more memory and faster memory. And the new ACTblades offer […]

Lenovo Launches Edge-to-Cloud Solutions with AMD EPYC 7003 CPUs

RESEARCH TRIANGLE PARK, NC – March 16, 2021 – Today, Lenovo (HKSE: 992) (ADR: LNVGY) Data Center Group (DCG) announced ThinkSystem and ThinkAgile hyperconverged infrastructure (HCI) solutions to support edge-to-cloud computing. These hybrid cloud solutions are designed to help organizations of all sizes, modernize, better secure their IT infrastructure, and deliver faster data insights – […]

Deci and Intel Collaborate to Optimize Deep Learning Inference on Intel’s CPUs

Deci, the deep learning company building the next generation of AI, announced a broad strategic business and technology collaboration with Intel Corporation to optimize deep learning inference on Intel Architecture (IA) CPUs. As one of the first companies to participate in Intel Ignite startup accelerator, Deci will now work with Intel to deploy innovative AI technologies to mutual customers.

AMD Launches 7nm ‘Milan’ EPYC Chips, Ups Price/Performance Ante for HPC, Data Center CPUs

Striving to continue on its path back to HPC and data center processor market prominence, AMD this morning introduced a new series of EPYC “Milan” CPUs that industry analysts and customers say looks to be a price/performance juggernaut. The EPYC 7003 series (SKUs below), designed by AMD and fabricated using Taiwan Semiconductor Manufacturing Co.’s 7-nanometer […]

Spotting HPC and Exascale Bottlenecks with TAU CPU/GPU/MPI Profiler

Programmers cannot blindly guess which sections of their code might bottleneck performance. This problem is worsened when codes run across the variety of hardware platforms supported by the Exascale Computing Project (ECP). A section of code that runs well on one system might be a bottleneck on another system. Differing hardware execution models further compound the performance challenges that face application developers; these models can include the somewhat restricted SIMD (Single Instruction Multiple Data) and SIMT (Single Instruction Multiple Thread) computing for GPU models and the more complex and general MIMD (Multiple Instruction Multiple Data) for CPUs. New software programming models, such as Kokkos, also introduce multiple layers of abstraction and lambda functions that can hide or obscure the low-level execution details due to their complexity and anonymous nature. Differing memory systems inside a node and differences in the communications fabric that connect high-performance computing (HPC) nodes in a distributed supercomputer environment add even greater challenges in identifying performance bottlenecks during application performance analysis.

University of Stuttgart’s Hawk HPC System to Go CPU-GPU for Deep Learning Workloads

Add the High Performance Computing Center at the University of Stuttgart (HLRS) to the list of supercomputing organizations going from CPU-only to CPU-GPU architectures. HLRS announced this morning it will add Nvidia graphic processing units to its Hawk supercomputer, a Hewlett Packard Enterprise Apollo system installed last February. One of Europe’s most powerful HPC systems, […]