Efficient algorithms for identifying privacy vulnerabilities

Aris Gkoulalas-Divanis; Stefano Braghin

doi:10.1109/ISC2.2015.7366170

ISC2 2015

Conference paper

24 Dec 2015

Efficient algorithms for identifying privacy vulnerabilities

View publication

Abstract

The automatic identification of privacy vulnerabilities in datasets is an important step in the privacy-preserving data publishing process, and an area of increased interest for commercial data masking products. In this paper, we propose two multi-threaded algorithms for discovering privacy vulnerabilities in datasets, in the form of combinations of attributes leading to few records. Our algorithms fully utilize the execution environment and outperform the state-of-the-art to the extent that we had to design a multi-threaded counterpart of the state-of-the-art method to form the baseline for our experiments. Through experimental evaluation on a large set of datasets, we show that our algorithms can analyze microdata consisting of millions of records in less than 10 minutes, when the baseline method required more than 3 hours.

Paper