Increasing HPC Cluster Productivity Through System Resource Tracking

This white paper from Bright Computing, “Increasing HPC Cluster Productivity Through System Resource Tracking” addresses the necessary steps to give administrators, managers, and users the information they need to use HPC system resources effectively, to maximize system productivity, to enable effective resource sharing, to identify waste and to provide charge-back capability.

Podcast: Streamlined Data Science through Jupyter Lab and Jupyter Enterprise Gateway

“Jupyter is a free, open-source, interactive web tool known as a computational notebook, which researchers can use to combine software code, computational output, explanatory text and multimedia resources in a single document. This podcast looks at how the Bright Jupyter integration makes it easy for customers to use Bright for Data Science through JupyterLab notebooks, and allows users to run their notebooks through a supported HPC scheduler, Kubernetes, or on the server running JupyterHub.”

Job of the Week: R&D Operations and Maintenance Lead at Lockheed Martin

Lockheed Martin is seeking an R&D Operations and Maintenance Lead in our Job of the Week. “This position is the CSCF Program’s Operations and Maintenance Lead. This position is responsible for managing a small team of geographically diverse System Administrators in a Research and Development (R&D), Multi User High Performance Computer (HPC), Multi Level Secure (MLS) Data Center on a 5×12 schedule.”

Job of the Week: System Administrator at DE Shaw Research

DE Shaw Research is seeking System Administrators for Servers, Clusters and Supercomputers in our Job of the Week. “Exceptional sysadmins sought to manage systems, storage, and network infrastructure for a New York–based interdisciplinary research group. Ideal candidates should have strong fundamental knowledge of Linux concepts such as file systems, networking, and processes in addition to practical experience administering Linux systems.”

Podcast: Bright Computing forges eX3 at Simula Research Laboratory

The eX3 infrastructure allows Norwegian HPC researchers and their international collaborators to explore bleeding-edge hardware and software that will be instrumental to the coming generation of supercomputers. “Simula chose Bright Cluster Manager to provide comprehensive management of eX3, enabling the organization to administer its HPC platform as a single entity; provisioning the hardware, operating systems and workload managers from a unified interface.”

Job of the Week: Senior Level Unix/Linux Systems Engineer at Lockheed Martin Space Systems

Lockheed Martin Space is seeking a Senior Level Unix/Linux Systems Engineer in our Job of the Week. “The coolest jobs on this planet… or any other… are with Lockheed Martin Space. We are seeking a Multi-Level Security Subject Matter Expert for Unix/Linux/SE Linux/HPC System Engineers-Developers-Administrators and Research and Development HPC program team.”

Video: Energy Efficient Computing using Dynamic Tuning

Lubomir Riha from IT4Innovations gave this talk as part of the POP HPC webinar series. “This webinar focused on tools designed to improve the energy-efficiency of HPC applications using a methodology of dynamic tuning of HPC applications, developed under the H2020 READEX project. The READEX methodology has been designed for exploiting the dynamic behaviour of software. At design time, different runtime situations (RTS) are detected and optimized system configurations are determined. RTSs with the same configuration are grouped into scenarios, forming the tuning model. At runtime, the tuning model is used to switch system configurations dynamically.”

Altair offers free training tools and software licensing in response to COVID-19

Over at the Altair Blog, CEO James Scapa writes that the company is offering free training tools and software licensing in response to COVID-19. “Altair can provide temporary software licenses for clients working from home without access to their enterprise Altair software solutions with no additional fees or charges. If customers wish to move their HyperWorks Units from on-premises to Altair hosted servers or require temporary software licenses, they should contact their account representative.”

UKRI Awards ARCHER2 Supercomputer Services Contract

UKRI has awarded contracts to run elements of the next national supercomputer, ARCHER2, which will represent a significant step forward in capability for the UK’s science community. ARCHER2 is provided by UKRI, EPCC, Cray (an HPE company) and the University of Edinburgh. “ARCHER2 will be a Cray Shasta system with an estimated peak performance of 28 PFLOP/s. The machine will have 5,848 compute nodes, each with dual AMD EPYC Zen2 (Rome) 64 core CPUs at 2.2GHz, giving 748,544 cores in total and 1.57 PBytes of total system memory.”

Job of the Week: HPC Systems Engineer at William & Mary

William & Mary in Virginia seeks an HPC Systems Engineer in our Job of the Week. “The HPC Systems Engineer provides day-to-day operations, maintenance and hardware engineering support for computer systems managed by the Research Computing (RC) within the Office of Information Technology. These systems include Linux-based HPC clusters; departmental servers and workstations under the control of RC; These systems and the applications that run on them are characterized by leading-edge computing, networking, and software technologies, complex and demanding computational requirements, and large data-sets.”