About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Publication
IPDPSW 2016
Conference paper
Big data for medical image analysis: A performance study
Abstract
Big data systems can be used to facilitate powerfulmedical image analysis at scale. Understanding their behaviorsin this context can lead to many benefits, ranging from superiorinfrastructure configurations to optimized parallel algorithmimplementations. This paper is, to our knowledge, a first steptowards developing such an understanding for state-of-the-artbig data platforms. We characterize a representative medicalimage segmentation pipeline, detailing the per-stage CPU, memory, I/O reads and writes, and execution time patterns. Thischaracterization has already helped us overcome a bottleneckpersistently causing analysis to crash unexpectedly, and avoidpoor architecture choices on storage and parallel execution.