Mining Effective Subsequences with Application in Marketing Attribution
Abstract
In this paper, we present a new data mining framework for discovering sequence effects. In particular, we focus on the sequences consisting of actions that are taken in chronological order, like sequences of clinical procedures or marketing actions. Each sequence is associated with a binary outcome, a success or a failure. We investigate the hypothesis that certain subsequences of actions contribute to successes, which we call effective subsequences. A generic data mining algorithm for extracting effective subsequences is proposed, which is verified both quantitatively and qualitatively. We experimented our effective subsequence mining algorithm on a real sales opportunity dataset. Based on the subsequence model, a market campaign attribution model is proposed, with application to the same sales dataset.