Sign up for our newsletter and get the latest HPC news and analysis.
Send me information from insideHPC:


Altair Acquires HPC I/O Diagnostics Specialist Ellexus

Altair has announced its second acquisition in two days. The company today announced the acquisition of Ellexus, an input/output (I/O) analysis tool designed to help customers address issues quickly, improving speed accuracy and cloud readiness. Ellexus software products, Mistral and Breeze, are used for I/O diagnostics, optimization, and dependency detection by HPC administrators of large enterprises. […]

Second GPU Cloudburst Experiment Paves the Way for Large-scale Cloud Computing

Researchers at SDSC and the Wisconsin IceCube Particle Astrophysics Center have successfully completed a second computational experiment using thousands of GPUs across Amazon Web Services, Microsoft Azure, and the Google Cloud Platform. “We drew several key conclusions from this second demonstration,” said SDSC’s Sfiligoi. “We showed that the cloudburst run can actually be sustained during an entire workday instead of just one or two hours, and have moreover measured the cost of using only the two most cost-effective cloud instances for each cloud provider.”

NVIDIA Jarvis AI SDK Fuses Vision, Speech, and other Sensors into One System

“The NVIDIA Jarvis SDK offers a complete workflow to build, train and deploy GPU-accelerated AI systems that can use visual cues such as gestures and gaze along with speech in context. For example lip movement can be fused with speech input to identify the active speaker. Gaze can be used to understand if the speaker is engaging the AI agent or other people in the scene. Such multi-modal fusion enables simultaneous multi-user, multi-context conversations with the AI agent that need deeper understanding of the context.”

The Convergence of HPC and AI

In this special guest feature, Bill Wagner from Bright Computing writes that the convergence of HPC & AI presents new challenges for containers, job scheduling, and system management. “But here’s the rub … traditional HPC applications run under the jurisdiction of an HPC workload manager like Slurm or PBS Pro, whereas machine learning applications are primarily run in containers under the jurisdiction of a container orchestration system, such as Kubernetes.”

Video: UberCloud Containers on Kubernetes

Burak Yenier from the UberCloud gave this talk at the High Performance Container Workshop at ISC 2019. “This workshop will outline the current state of Linux Containers, what challenges are hindering the adoption in HPC/BigData and how containers can foster improvements when applied to the field of HPC, Big Data and AI in the mid- and long-term.”

Kmesh.io – Multicloud Lustre-as-a-Service

Vinay Gaonkar from Kmesh.io gave this talk at LUG 2019. “The need for cloud-based Lustre, he explained, is driven by both technological and business factors. In the end, this all adds up to a trend in which the cloud world is moving from heavy use of centralized data lakes to a much more flexible and responsive architecture of many smaller, distributed data ponds.”

Video: Kubernetes for Biomedical Analysis

Kevin Sayers from the University of Basel gave this talk at the hpc-ch forum. “The SIB Swiss Institute of Bioinformatics BioMedIT project is developing the computing infrastructure which will enable biomedical analyses on sensitive human data across multiple sites as part of the Swiss Personalized Health Network. This presentation will focus on our experience assessing Kubernetes to support these biomedical workloads, and the benefits it provides to researchers in the community.”

Sylabs boosts HPC Containers with SingularityPRO 3.1

Today Sylabs announced the release of SingularityPRO 3.1 in what the company is calling a watershed moment for enterprise customers everywhere. “SingularityPRO 3.1 is the most highly anticipated release of our enterprise software ever,” said Gregory Kurtzer, founder and CEO of Sylabs. “With this release, we’re rapidly advancing container science, making it a truly opportune time for those seeking to containerize the most demanding enterprise performance computing workloads in the most trusted way.”

Univa: Optimizing On-Premise Clusters and Migration to the Cloud

In this video from ISC 2018, Rob Lalonde describes how Univa products optimize on-premise clusters and migration to the Cloud. “Univa is the leading independent provider of software-defined computing infrastructure and workload orchestration solutions. Univa’s intelligent cluster management software increases efficiency while accelerating enterprise migration to hybrid clouds. We help hundreds of companies to manage thousands of applications and run billions of tasks every day.”

NVIDIA Releases Code for Accelerated Machine Learning

Today NVIDIA made a number of announcements centered around Machine Learning software at the Computer Vision and Pattern Recognition Conference in Salt Lake City. “NVIDIA is kicking off the conference by demonstrating an early release of Apex, an open-source PyTorch extension that helps users maximize deep learning training performance on NVIDIA Volta GPUs. Inspired by state of the art mixed precision training in translational networks, sentiment analysis, and image classification, NVIDIA PyTorch developers have created tools bringing these methods to all levels of PyTorch users.”