Sign up for our newsletter and get the latest HPC news and analysis.
Send me information from insideHPC:


Slidecast: BigDL Open Source Machine Learning Framework for Apache Spark

In this video, Beenish Zia from Intel presents: BigDL Open Source Machine Learning Framework for Apache Spark. “BigDL is a distributed deep learning library for Apache Spark*. Using BigDL, you can write deep learning applications as Scala or Python* programs and take advantage of the power of scalable Spark clusters. This article introduces BigDL, shows you how to build the library on a variety of platforms, and provides examples of BigDL in action.”

Video: The Separation of Concerns in Code Modernization

In this video, Larry Meadows from Intel describes why modern processors require modern coding techniques. With vectorization and threading for code modernization, you can enjoy the full potential of Intel Scalable Processors. “In many ways, code modernization is inevitable. Even EDGE devices nowadays have multiple physical cores. And even a single-core machine will have hyperthreads. And keeping those cores busy and fed with data with Intel programming tools is the best way to speed up your applications.”

Appentra Auto-parallelization coming to Emerging Technologies Showcase at SC18

Today HPC Startup Appentra Solutions announced the company’s plans to showcase its auto-parallelization technologies at the Emerging Technologies Showcase at SC18. “SC18 is the premier international conference for High Performance Computing, networking, storage, and analysis. Every year, the Emerging Technologies program at the SC conference, showcases innovative solutions, from industry, government laboratories and academia, that may significantly improve and extend the world of HPC in the next five to fifteen years.”

Optimizing HPC Code with Roofline Analysis

In this special guest feature, James Reinders describes why roofline estimation is a great tool for code optimization in HPC. “As a long-time teacher of optimization techniques, I can confidently say that Roofline analysis is a must-have for anyone optimizing for performance. This has not always been the case. As I will explain, today it is an important technique to draw upon when doing performance optimization.”

ExaFLOW Project takes on High-order CFD

After three years of working on key algorithmic challenges in CFD, the European ExaFLOW Project it touting a series of industry milestones. With three flagship runs, ExaFLOW has managed to work on different specific CFD use cases which highlight the importance of their outcomes for both industry and academia. “After three years of working on key algorithmic challenges in CFD, the European ExaFLOW Project it touting a series of industry milestones. With three flagship runs, ExaFLOW has managed to work on different specific CFD use cases which highlight the importance of their outcomes for both industry and academia.”

Video: The March to Exascale

As the trend toward exascale HPC systems continues, the complexities of optimizing parallel applications running on them increase too. Potential performance limitations can occur at the application level which relies on the MPI. While small-scale HPC systems are more forgiving of tiny MPI latencies, large systems running at scale prove much more sensitive. Small inefficiencies can snowball into significant lag.

Ampere Augments ARM Servers with New Developer Platform

Today ARM server vendor Ampere introduced a new developer platform and program to facilitate the open development of next-generation cloud applications. Available through Lenovo, the new server platform with Ampere’s eMAG processors is available now for purchase and developers can order by visiting Ampere’s new developer web site. “By providing access to a development environment with open standards and full open documentation, we can expand the ecosystem and optimize the developer experience to innovate on cloud applications not yet imagined.”

Learn What to Do Next with Intel VTune Amplifier Application Performance Snapshot

Tuning code has, for a long time, been an art. Knowing what to look for and how to correct inefficiencies in serious numerical computations has not been easy for most programmers. It’s often hard to even know which tool to start with. Which is why the Intel® VTune™ Amplifier Application Performance Snapshot could prove to be a great way to get an instant summary of an application’s performance characteristics and issues.

Sylabs Tunes Singularity 3.0 Containers for Machine Learning

Today Sylabs announced that it has released a new version of its innovative container software: Singularity 3.0. With new enterprise-class features, Singularity is now the premier container runtime solution, enabling your company to seamlessly and efficiently tackle today’s most demanding AI, machine learning, and advanced analytic workloads. “Lenovo and Sylabs have collaborated for over two years around a shared vision of producing lightweight HPC containers that are flexible, secure, and reproducible, while also delivering performance that is equal to native OS performance.”

Maximum Performance, Minimum Effort: Intel® Performance Libraries

“Over two decades, Intel continued its efforts to refine libraries optimized to coax the greatest performance from Intel® processors. In this video, Noah Clemons, staff technical consulting engineer at Intel talks about the latest specialized libraries and their contributions for highly-optimized applications.”