DynMR: Dynamic mapreduce with reducetask interleaving and maptask backfilling

Jian Tan; Alicia Chiny; Zane Zhenhua Hu; Yonggang Huy; Shicong Meng; Xiaoqiao Meng; Li Zhang

doi:10.1145/2592798.2592805

EuroSys 2014

Conference paper

14 Apr 2014

DynMR: Dynamic mapreduce with reducetask interleaving and maptask backfilling

View publication

Abstract

In order to improve the performance of MapReduce, we design DynMR. It addresses the following problems that persist in the existing implementations: 1) difficulty in selecting optimal performance parameters for a single job in a fixed, dedicated environment, and lack of capability to configure parameters that can perform optimally in a dynamic, multijob cluster; 2) long job execution resulting from a task longtail effect, often caused by ReduceTask data skew or heterogeneous computing nodes; 3) inefficient use of hardware resources, since ReduceTasks bundle several functional phases together and may idle during certain phases. DynMR adaptively interleaves the execution of several partially-completed ReduceTasks and backfills MapTasks so that they run in the same JVM, one at a time. It consists of three components. 1) A running ReduceTask uses a detection algorithm to identify resource underutilization during the shuffle phase. It then gives up the allocated hardware resources efficiently to the next task. 2) A number of ReduceTasks are gradually assembled in a progressive queue, according to a flow control algorithm in runtime. These tasks execute in an interleaved rotation. Additional ReduceTasks can be inserted adaptively to the progressive queue if the full fetching capacity is not reached. MapTasks can be backfilled therein if it is still underused. 3) Merge threads of each ReduceTask are extracted out as standalone services within the associated JVM. This design allows the data segments of multiple partially-complete ReduceTasks to reside in the same JVM heap, controlled by a segment manager and served by the common merge threads. Experiments show 10% ∼ 40% improvements, depending on the workload. Copyright © 2007 by the Association for Computing Machinery, Inc.

Paper