About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Publication
HPCC/SmartCity/DSS 2016
Conference paper
Composable Locality Optimizations for Accelerating Parallel Forest Computations
Abstract
Graph algorithms with large, sparse inputs oftentimes perform poorly on cache-based platforms. For parallel spanning forest and minimum spanning forest computations, we propose three locality optimizations, Update, Stages, and PSM, that improve the cache performance. Update exploits the 'graft-and-shortcut' pattern of the base algorithms. PSM transforms irregular accesses into regular ones. Stages is a meta approach based on evolution random graph theory that we use to combine Update and Stages. On the target platform, our combined optimization achieves up to 19 times speedups over the base algorithms.