Characterizing the Communication Demands of the Graph500 Benchmark on a Commodity Cluster

Pablo Fuentes; José Luis Bosque; Ramon Beivide; Mateo Valero; Cyriel Minkenberg

doi:10.1109/BDC.2014.16

BDC 2014

Conference paper

05 Nov 2015

Characterizing the Communication Demands of the Graph500 Benchmark on a Commodity Cluster

View publication

Abstract

Big Data applications have become more and more important over the last few years. Such applications are focused on the analysis of huge amounts of unstructured information and present a series of differences with traditional High Performance Computing (HPC) applications. For illustrating such dissimilarities, this paper analyzes the behavior of the most scalable version of the Graph500 benchmark when run on a state-of-the-art commodity cluster facility. Our work shows that this new computation paradigm stresses the interconnection subsystem. In this work, we provide both analytical and empirical characterizations of the Graph500 benchmark, showing that its communication needs bound the achieved performance on a cluster facility. Up to our knowledge, our evaluation is the first to consider the impact of message aggregation on the communication overhead and explore the selection of a trade off that diminishes benchmark execution time, increasing system performance.

Conference paper