In this video from the HPCAC Swiss Conference 2014, Fisnik Kraja from science + computing ag presents: Performance Analysis and Optimizations of CAE Applications.
The Holland Computing Center HPC support team at the University of Nebraska is using Allinea Software’s unified debugging and profiling tools to help users speed up their scientific computing. With the need to support four clusters and total of 20,000 compute cores, Nebraska’s HPC support team provides assistance to many users of large MPI applications […]
Over at the ISC Blog, Bernd Mohr writes that HPC programmers are still in demand thanks to the complexity of today’s hybrid supercomputers. So, in reality programming HPC systems is still very complex and if you manage to create a correctly working program, it does not automatically mean you get a very efficiently working program. […]
“The Relion 2808GT was designed to deliver maximum compute power with a minimal datacenter footprint and maximum efficiency. The system supports eight GPUs or coprocessors in two rack units and provides a higher compute density that any other server on the market. Configured with eight NVidia GPUs or eight Intel Xeon Phi co-processors a single Relion 2808GT can achieve over 8TFLOPs of double precision floating point peak performance.”
“Verbs and RDMA-based networks will continue to be important focus areas, but with the new OFWG, the OFA is expanding its focus to include additional application-centric APIs that could be better suited to other areas and technologies,” said Jim Ryan, chairman, OFA. “Application-centric I/O is an extension of the approach historically taken by the OFA, and we will use our expertise in this area to guide the open source development of APIs and I/O libraries.”
“This talk describes an experimental methodology, ParalleX, that addresses Exascale challenges through a change in the fundamental model of parallel computation from that of the communicating sequential processes (e.g., MPI) to an innovative synthesis of concepts involving message-driven work-queue execution in the context of a global address space.”