Carnegie Mellon University
Browse

Data Layout Transformation for Stencil Computations on Short-Vector SIMD Architectures

Download (369.81 kB)
journal contribution
posted on 2011-03-01, 00:00 authored by Tom Henretty, Kevin Stock, Louis-Noel Pouchet, Franz Franchetti, J. Ramanujam, P. Sadayappan
<p>Stencil computations are at the core of applications in many domains such as computational electromagnetics, image processing, and partial differential equation solvers used in a variety of scientific and engineering applications. Short-vector SIMD instruction sets such as SSE and VMX provide a promising and widely available avenue for enhancing performance on modern processors. However a fundamental memory stream alignment issue limits achieved performance with stencil computations on modern short SIMD architectures. In this paper, we propose a novel data layout transformation that avoids the stream alignment conflict, along with a static analysis technique for determining where this transformation is applicable. Significant performance increases are demonstrated for a variety of stencil codes on three modern SIMD-capable processors.</p>

History

Related Materials

Publisher Statement

The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-642-19861-8_13

Date

2011-03-01