Sign up for our newsletter and get the latest HPC news and analysis.
Send me information from insideHPC:


Job of the Week: HPC User Support Technician at NREL

NREL is seeking an HPC User Support Technician in our Job of the Week. “Your job will be front-line support for the High Performance Computing user community which includes scientists, researchers, and students at many levels of education and experience.”

Podcast: Software Deployment and Continuous Integration for Exascale

In this Let’s Talk Exascale podcast,  Ryan Adamson from Oak Ridge National Laboratory describes how his role at the Exascale Computing Project revolves around software deployment and continuous integration at DOE facilities. “Each of the scientific applications that we have depends on libraries and underlying vendor software,” Adamson said. “So managing dependencies and versions of all of these different components can be a nightmare.”

Altair PBS Works Steps Up to Exascale and the Cloud

In this video from SC19, Sam Mahalingam from Altair describes how the company is enhancing PBS Works software to ease the migration of HPC workloads to the Cloud. “Argonne National Laboratory has teamed with Altair to implement a new scheduling system that will be employed on the Aurora supercomputer, slated for delivery in 2021. PBS Works runs big — 50,000 nodes in one cluster, 10,000,000 jobs in a queue, and 1,000 concurrent active users.”

Job of the Week: Sr. HPC System Administrator at ASRC Federal

ASRC Federal in Maryland is seeking a Sr. HPC System Administrator in our Job of the Week. “This position is a member of an HPC Support team focusing on storage hardware and software for two supercomputing clusters. You will specialize in both the monitoring and management of storage systems and storage-related network management for a large supercomputer.”

University of Siegen Doubles Down on Bright Computing for HPC Management

Bright Computing recently announced that the University of Siegen has expanded its use of the company’s HPC system management software to address the increasingly varied and challenging demands from its research staff. “We have had nothing but positive experiences with Bright Cluster Manager and the Bright support team,” said Daniel Harlacher, HPC team leader at University of Siegen. “I would highly recommend the technology. Bright increases the performance of my team by automating many everyday tasks, freeing my team to concentrate on delivering exceptional service to our university departments.”

Podcast: SCR Scalable Checkpoint/Restart Paves the Way for Exascale

A software product called the Scalable Checkpoint/Restart (SCR) Framework 2.0 recently won an R&D 100 Award. In this episode, Elsa Gonsiorowski and Kathryn Mohror of LLNL discuss what SCR does, the challenges involved in creating it, and the impact it is expected to have in HPC. “SCR enables HPC simulations to take advantage of hierarchical storage systems, without complex code modifications. With SCR, the input/output (I/O) performance of scientific simulations can be improved by orders of magnitude.”

Moving HPC to the Cloud with Easy8 and Bright Cluster Manager

In this video from SC19, Bill Wagner from Bright Computing announces the latest release of Bright Cluster Manager software for HPC, OpenStack, and Data Science. Now available for free for clusters up to eight nodes with the Easy8 program, Bright Cluster Manager simplifies building and managing Linux clusters from the core to the cloud and to the edge. “We are excited about the new features in 9.0,” said Bill Wagner, CEO of Bright Computing. “Our development team has been hard at work implementing a host of features designed to better extend on-premise clusters to the public cloud and edge, improve ease of use, lower administrative costs, and increase standardization across the enterprise.”

LIqid Steps up with Composable Infrastructure at SC19

In this video from SC19, Sumit Puri from Liqid describes the company’s innovative composable infrastructure technology for HPC. “We don’t build servers statically. We build servers dynamically by taking software and reconfiguring servers on the fly to have any amount of storage, GPU, networking, or compute that the application layer requires. Our mission is to turn the data center from statically configured to dynamically configurable.”

Job of the Week: Systems Administrators for Servers, Clusters and Supercomputers at D.E. Shaw Research

D.E. Shaw Research is seeking Systems Administrators for Servers, Clusters and Supercomputers in our Job of the Week. “Our research effort is aimed at achieving major scientific advances in the field of biochemistry and fundamentally transforming the process of drug discovery. Exceptional sysadmins sought to manage systems, storage, and network infrastructure for a New York–based interdisciplinary research group.”

Job of the Week: HPC System Administrator at UC Riverside

The University of California, Riverside is seeking an HPC System Administrator in our Job of the Week. UC Riverside’s (UCR) research computing infrastructure is provided by a central HPC facility. This facility operates Linux clusters with over 4,500 CPU cores, 50TB of total system RAM and several GPU nodes. Big Data storage is handled by a centralized GPFS-based storage cluster with over 3PB of disk space for production and backup storage. The incumbent will be part of a team responsible for the systems administration of this HPC infrastructure, including development of software for parallel computing, network management, data security and user training.”