About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Publication
SHPCC 1994
Conference paper
Efficient parallel algorithm for the 3-D FFT NAS parallel benchmark
Abstract
In this paper we propose an efficient algorithm to implement the 3-D NAS FFT benchmark. The proposed algorithm overlaps the communication with the computation. On parallel machines supporting overlap of communication with computation, our proposed algorithm can outperform the non-overlapping version of this algorithm by a factor close to two.