About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Publication
ICPP 1993
Conference paper
Online algorithms for handling skew in parallel joins
Abstract
When the work involved in a join is partitioned among multiple processors in the parallel join, the skew in the operand relations can result in significant imbalance in the work assigned to the different processors. This imbalance can cause significant degradation in the response time lor the join operation. In this paper, we present two new algorithms, the pure online PO and the online/offline algorithm OO, for handling skew in parallel join evaluation. Lsing the histogram data of the operand relations, the skew handling algorithms can achieve any permissible tolerance in the join work imbalance among the processors. Unlike previous work on handling skew in parallel joins, the algorithms are online in the sense that they process most (or all) of the histogram data as it is generated. We compare these algorithms and their variations experimentally and find that a particular variation of algorithm OO is usually the algorithm of choice.