Mining echocardiography workflows for disease discriminative patterns.
Abstract
To provide quick diagnostic insights to medical practitioners into echocardiograms by only analyzing the echocardiogram workflows (defined as the sequence of modalities examined). We define a dictionary of workflows, called subflows, which are commonly encountered in echocardiography workflows but are mutually exclusive. We represent each workflow as a mixture of dictionary subflows and learn discriminative models for various cardiac diseases using Support Vector Machines. Using these discriminative models, we can predict occurrences of diseases for any, yet unseen, echocardiogram workflow. Working with a corpus of 2300 echocardiograms workflows, we build a dictionary of 172 subflows. Using the associated reports (expert created) we identify the ground-truth diagnoses. We then build discriminative models for 7 different cardiac diseases. Using just the workflow as input, these models can predict diseases on average with over 75% accuracy. Mining collection of echocardiography workflows, for the first time, we are able to predict diseases without even looking at the image contents.