Untether AI Unveils At-Memory Compute Architecture at Hot Chips

PALO ALTO — Untether AI, an at-memory computation company for artificial intelligence (AI) workloads, today announced at the HOT CHIPS 2022 conference its next-generation architecture for accelerating AI inference workloads called speedAI devices, with an internal codename “Boqueria.” At 30 TeraFlops per watt (TFlops/W) and 2 PetaFlops of performance, the speedAI architecture sets a new […]

Verge.io Unveils Virtualized GPU Computing

ANN ARBOR, Mich.—August 16, 2022 — Verge.io, with a mission to offer a simpler way to virtualize data centers, has added new features to its Verge-OS software designed to give users the performance of GPUs as virtualized, shared resources. The intent is to create a cost-effective, simple and flexible way to perform GPU-based machine learning, remote […]

Google Cloud Says TPU-Powered Machine Learning Cluster Delivers 9 Exaflops Aggregate Power

As HPC enters the exascale era, watchers of the TOP500 list of the world’s most powerful supercomputers will look to see if the updated list, to be released this month at the ISC conference in Hamburg, will include systems that break the vaunted exaflops barrier. That said, Google Cloud yesterday unveiled what it called the […]

Composable Memory within CXL 2.0 Protocol Shown by Liqid, Samsung, Tanzanite

The Compute Express Link (CXL) Consortium is chasing the utopian tech dream, now being realized in an increasing number of high performance servers, of a high-speed, open-interface interconnect that enables the heterogenous Babel of CPUs and accelerators to talk to each other, to all get along. It’s a critically important capability for AI, machine learning […]

HPE Launches 2 HPC-AI Offerings for ML Implementation and and Collaboration

Two announcements today from HPE underscore the strategic imperative of combining HPC and AI — along with demand for systems that ease AI implementation complexity. HPE announced its Machine Learning Development System, designed to accelerate AI training models at scale. HPE said the system delivers value in days, rather than the typical weeks or months, […]

San Diego Supercomputer Center to Offer Two Summer Institutes

April 7, 2022 — The San Diego Supercomputer Center at UC San Diego has planned summer institutes for June and August, one focused on cyberinfrastructure-enabled machine learning and the on high-performance computing (HPC) and data science. Application deadlines are April 15 and May 13, respectively. The Cyberinfrastructure-Enabled Machine Learning (CIML) Summer Institute will be held June 27-29 […]

MLPerf Results Highlight Advances in Machine Learning Inference Performance and Efficiency

SAN FRANCISCO – April 6, 2022 – Today MLCommons, an open engineering consortium, released new results for three MLPerf benchmark suites – Inference v2.0, Mobile v2.0, and Tiny v0.7. MLCommons said the three benchmark suites measure the performance of inference – applying a trained machine learning model to new data. Inference enables adding intelligence to a wide range […]

Solving AI Cluster Design Challenges with a Building Block Approach

[SPONSORED POST] When considering a large complex system, such as an AI cluster, supercomputer, or compute cluster, you may think you only have two options—build from scratch from the ground up, or buy a pre-configured, supercomputer-in-a-box from a major technology vendor that everyone else is buying. But there is a third option that takes a best-of-both-worlds approach. This gives you “building blocks” expertly designed around network, storage, and compute configurations that are balanced, but also flexible enough to provide scalability for your specific project needs.

Lenovo Helps Power NVIDIA’s Data-Center-Scale Omniverse Computing System for Industrial Digital Twins

March 22, 2022 – RESEARCH TRIANGLE PARK, N.C. – Today, at NVIDIA GTC, Lenovo (HKSE: 992) (ADR: LNVGY) announced its extended work with NVIDIA to deliver industry 3D simulation and design collaboration capabilities by providing the infrastructure for NVIDIA OVX, a computing system designed to run large-scale Omniverse digital twins. The collaboration leverages Lenovo infrastructure and […]