Video: Debugging HPC Applications at Massive Scales

September 6, 2015 by Doug Black

In this video, LLNL scientists discuss the challenges of debugging programs at scale on the Sequoia supercomputer, which has 1.6 million processors.

Key insights:

Bugs in parallel HPC applications are difficult to debug because errors propagate among compute nodes, programmers must debug thousands of nodes or more, and bugs might manifest only at large scale.
Although conventional approaches like testing, verification, and formal analysis can detect a variety of bugs, they struggle at massive scales and do not always account for important dynamic properties of program execution.
Dynamic analysis tools and algorithms that scale with an application’s input and number of compute nodes can help programmers pinpoint the root cause of bugs.

Filed Under: Compute, Government, HPC Hardware, HPC Software, Industry Segments, Parallel Programming, Research / Education, Resources, Tools, Videos Tagged With: debuggers, LLNL, Sequoia supercomputer, Weekly Newsletter Articles

Energy efficiency drives HPC to the cloud

The high-performance computing (HPC) market is witnessing a notable shift towards the cloud, partially driven by the benefits of enhanced energy efficiency. According to Hyperion Research nearly every organization running HPC workloads is either already using or investigating the cloud to accelerate application performance, with the cloud market for HPC workloads forecast to reach $11.5 […]

Download

Comments

Aditya says

September 7, 2015 at 8:22 am

Gentlemen,

Consider trying out the Automatski AutoSIM IoT Simulator for debugging massive scale Algorithms. its an IoT simulator but can be easily repurposed for HPC debugging at the scale of billions of “REPEATABLE” & “DEBUGGABLE” events/second

Video: Debugging HPC Applications at Massive Scales

Sponsored Guest Articles

Accelerated HPC for Energy Efficiency with AWS and NVIDIA

White Papers

Energy efficiency drives HPC to the cloud

Comments

Featured RSS Feed

More News from insideBIGDATA

Video: Debugging HPC Applications at Massive Scales

Sponsored Guest Articles

Accelerated HPC for Energy Efficiency with AWS and NVIDIA

White Papers

Energy efficiency drives HPC to the cloud

Join Us On Social Media

Comments

Related Posts

Featured RSS Feed

More News from insideBIGDATA