Language level checkpointing support for stream processing applications

Gabriela Jacques-Silva; Buǧra Gedik; Henrique Andrade; Kun-Lung Wu

doi:10.1109/DSN.2009.5270344

DSN 2009

Conference paper

25 Nov 2009

Language level checkpointing support for stream processing applications

View publication

Abstract

Many streaming applications demand continuous processing of live data with little or no downtime, therefore, making high-availability a crucial operational requirement. Fault tolerance techniques are generally expensive and when directly applied to streaming systems with stringent throughput and latency requirements, they might incur a prohibitive performance overhead. This paper describes a flexible, light-weight fault tolerance solution in the context of the SPADE language and the System S distributed stream processing engine. We devised language extensions so users can define and parameterize checkpoint policies easily. This configurable fault tolerance solution is implemented through code generation in SPADE, which reduces the overall application fault tolerance costs by incurring them only for the parts of the application that require it. In this paper we focus on the overall design of our checkpoint mechanism and we also describe an incremental checkpointing algorithm that is suitable for on-the-fly processing of high-rate data streams. ©2009 IEEE.

Conference paper