MailChimp Developer

Sign up for our newsletter and get the latest HPC news and analysis.
Send me information from insideHPC:


Arithmetic Intensity of Stencil Operations

F1.large

Applications that use 3D Finite Difference (3DFD) calculations are numerically intensive and can be optimized quite heavily to take advantage of accelerators that are available in today’s systems. The performance of an implementation can and should be optimized using numerical stencils. Choices made when designing and implementing algorithms can affect the Arithmetic Intensity (AI), which is a measure of how efficient an implementation, by comparing the flops and memory access.

OpenCL for Performance

OpenCL

“OpenCL is a fairly new programming model that is designed to help programmers get the most out of a variety of processing elements in heterogeneous environments. Many benchmarks that are available have demonstrated that excellent performance can be obtained over a wide variety of devices. Rather than lock an application into one specific accelerator, by using OpenCL, applications can be run over on a number of different architectures with each showing excellent speedups over a native (host cpu) implementation.”

Danish Meteorological to Install First Cray in Iceland

Cray Xc40

Today Cray announced that the Danish Meteorological Institute (DMI) has purchased a Cray XC supercomputer and a Cray Sonexion 2000 storage system. Through an arrangement with the Icelandic Meteorological Office (IMO), the system will be installed at the IMO datacenter in Reykjavik, Iceland for year-round power and cooling efficiency.

Training the Next Generation of Code Developers for HPC – Part 2

Rob Farber gives a tutorial at SC14

This is the second article in a two-part series about the challenges facing the HPC community in training people to write code and develop algorithms for current and future, massively-parallel, massive-scale HPC systems.

Climate Researchers Collaborate with HPC

BSC-ES_FORECASt

“The complexity and scale of weather and climate simulation have led weather centers and research groups to turn to their own community, either through direct collaboration or open source software initiatives, to increase performance and usability of these hugely complex models.”

Colfax to Offer Free Online Training for Intel Code Modernization

how

Today Colfax International announced free online workshops on parallel programming and optimization for Intel architecture, including Intel Xeon processors and Intel Xeon Phi coprocessors. “The Hands-on Workshop (HOW) series will introduce best practices to researchers and developers to efficiently extract maximum performance out of modern parallel processors, achieving shorter time to solution, higher research productivity, and future-proof design.”

Ray Tracing with Embree Kernels using Intel Xeon Phi

Embree

The Embree kernel approach, using the Intel Xeon Phi coprocessor is applicable to many situations. The implementation can be tuned to the hardware available, using different vector widths and workloads per ray. With a flexible toolkit for rendering, applications can take advantage of the latest hardware acceleration to achieve maximum performance.

Supermicro Deploys Sango Supercomputer at OIST in Japan

Okinawa Institute of Science & Technology Graduate University Sango Supercomputer (PRNewsFoto/Super Micro Computer, Inc.)

Today Super Micro announced the deployment of a new supercomputer in Japan at the Okinawa Institute of Science and Technology Graduate University (OIST) supporting scientists and researchers focused on discovery and innovation in the life, physical and environmental sciences.

Interview: Asetek Rides Rapid Adoption of Liquid Cooling for HPC

steve

The HPC industry’s expanded use of liquid cooling was evident at the recent ISC 2015 conference in Frankfurt. To learn more, we caught up with Steve Branton from Asetek.

Black-Scholes Pricing on Intel Xeon Phi

industry_finance-1024x768

“An expanding area of work both on the hardware front and the software side is to modify and optimize applications to run on both the host processor and a coprocessor. Many techniques to transform applications to reduce runtime have been discussed and implemented across a wide variety of applications.”