matrix Archives - High-Performance Computing News Analysis

Heterogeneous Streams with Intel Xeon Phi

January 21, 2016 by MichaelS

Matrix multiplies can be decomposed into tiles and executed very fast on the latest generations of coprocessors. Intel has developed the hStreams library that supports task concurrency on heterogeneous platforms. The concurrency may be across nodes (Xeon, KNC, KNL-SB, KNL-LB); within a node for small matrix operations; and in the overlapping of computation and communication, particularly for tiled solutions. It relieves the user of complexity in dealing with thread affinitization, offloading, memory types, and memory affinitization.

Filed Under: Coprocessors, HPC Hardware, HPC Software, Main Feature, Parallel Programming, Processors, Sponsored Post Tagged With: Intel, Intel TEC, intel xeon, Intel Xeon Phi, matrix, streams

Morton Ordering on the Intel Xeon Phi

October 8, 2015 by MichaelS

The Morton order is a mapping of multidimensional data to one dimension that preserves locality of the data. This is also known as Z-order. “By using Morton ordering as an alternative to row-major or column-major data storage, significant speedups can be achieved on the Intel Xeon Phi coprocessor or Intel Xeon CPU when performing matrix multiplies or matrix transposes.”

Filed Under: HPC Software, Main Feature, Parallel Programming, Sponsored Post Tagged With: intel xeon, Intel Xeon Phi, matrix, matrix transpose, MIC, morton reordering

Sparse Matrix Multiplication

October 1, 2015 by MichaelS

“A parallel implementation of SpMV can be implemented, using OpenMP directives. However, by allocating memory for each core, data races can be eliminated and data locality can be exploited, leading to higher performance. Besides running on the main CPU, vectorization can be implemented on the Intel Xeon Phi coprocessor. By blocking the data in various chunks, various implementations on the Intel Xeon Phi coprocessor can be run and evaluated.”

Filed Under: Coprocessors, HPC Hardware, HPC Software, Main Feature, News, Parallel Programming, Sponsored Post Tagged With: blocking, Intel, Intel Xeon Phi, matrix

Fast Matrix Multiply with OpenMP

April 2, 2015 by MichaelS

Solving many scientific and technical applications entails the use of matrix multiplies somewhere in the algorithm and thus the computer code. With today’s multicore CPUs, proper use of complier directives can speed up matrix multiplies significantly.

Filed Under: HPC Software, Parallel Programming, Tools Tagged With: matrix, OpenMP, Weekly Newsletter Articles

Heterogeneous Streams with Intel Xeon Phi

Morton Ordering on the Intel Xeon Phi

Sparse Matrix Multiplication

Fast Matrix Multiply with OpenMP

Sponsored Guest Articles

Microsoft and NVIDIA Together Advance AI

White Papers

Energy efficiency drives HPC to the cloud

Featured RSS Feed

More News from insideBIGDATA