“The majority of deep learning frameworks provide good out-of-the-box performance on a single workstation, but scaling across multiple nodes is still a wild, untamed borderland. This discussion follows the story of one researcher trying to make use of a significant compute resource to accelerate learning over a large number of CPUs. Along the way we note how to find good multiple-CPU performance with Theano* and TensorFlow*, how to extend a single-machine model with MPI and optimize its performance as we scale out and up on both Intel Xeon and Intel Xeon Phi architectures.”
Today Allinea released version 6.0 of their HPC development tools suite Allinea Forge and Performance Reports. Building on their commitment to serving the scientific HPC community, Allinea demonstrated the new features at SC15 last month in Austin.
In this podcast, the Radio Free HPC team shares their thoughts from SC15 in Austin. Henry is impressed by the increasing presence of FPGAs on the show floor. Dan is really impressed with Allinea Performance Reports profiling tool and how easy it is to use. And Rich sees SC15 as the crossroads that we’ll remember where Intel squared off with the official launch of their Omni-Path Interconnect and Scalable System Framework against the co-design alliance of OpenPOWER with IBM, Mellanox, and Nvidia.
Allinea Software will demonstrate significant extensions to their Forge integrated development tool suite and Performance Reports analytics tool at SC15. “Version 6.0 of both products delivers for developers, users, analysts and system administrators – not only on Intel Xeon and Xeon Phi platforms, but also ARM 64-bit and OpenPOWER platforms.”
In this video from ISC 2015, Mark O’Connor demonstrates how the Allinea Performance Reports tool enables coders to speed up hydro, a real-world CFD code. “Allinea Performance Reports are the most effective way to characterize and understand the performance of HPC application runs. One single-page HTML report elegantly answers a range of vital questions for any HPC site.”
“In my humble opinion, I think that debuggers and profiling tools are far too infrequently used. And it’s not because they’re not there. It’s because people just either don’t know about them, don’t do training on them, or don’t know how to use them. We’re in a state where we have less cycles than we’ve ever had per request, right? So being able to take full advantage of those cycles by having optimized code and optimized run patterns is crucial. Otherwise, you’re just not going to be able to get your work done and the science won’t get done.”
Allinea Software and NCSA are enabling software heavily used in industry to run faster and it is creating competitive advantages for some of the nation’s largest companies.
In this video from SC14, Patrick Wohlschlegel from Allinea Software demonstrates how the company’s Performance Reports took has been integrated into HP’s cluster management system for system administrators.
Scientists at the CSIR Council of Scientific and Industrial Research in India are accelerating their work thanks to profiling and debugging tools from Allinea Software.
“I have a few people helping out and tuning user applications, but we cannot serve all of our 500 users– we don’t have enough people. But if I have such a tool like Allinea Performance Reports, we tell them to try that out first and then if they have a problem, then they can do the first assessment themselves. Then we can use this tool to find the hard cases when my guys need to get involved. This is great.”