Home // ICWMC 2019, The Fifteenth International Conference on Wireless and Mobile Communications // View article


Variable Distinct L-diversity Algorithm Applied on Highly Sensitive Correlated Attributes

Authors:
Zakariae El Ouazzani
Hanan El Bakkali

Keywords: big data; anonymization; L-diversity technique; non−numerical attributes; correlation; Pearson.

Abstract:
In this information age, large amount of data is available online. These data are used by both internal and external sources for analysis and research purposes. The collected data is stored into huge data sets containing sensitive and Non−Sensitive Attributes. For the reason that attributes are gen- erally separated, the correlation between these various attributes is lost. Thus, it will be necessary to prevent attributes from losing the correlation between them or at least reduce the correlation loss. As a solution, correlated attributes are grouped together. Although, the data utility is preserved by reducing the correlation loss between Sensitive Attributes, privacy protection remains a serious concern. The main problem here is publishing data sets without revealing the sensitive information of individuals and in the same time preserving data utility. Most of the current researches on ensuring privacy in big data are centered on data anonymization. L-diversity is an anonymization technique that can be applied on a data set with one or multiple Sensitive Attributes. This paper proposes an algorithm that deals with sensitive numerical and non−numerical attributes. The algorithm applies the principle of L-diversity technique after grouping highly correlated attributes together through a vertical partitioning. Our proposed algorithm makes a balance between privacy and data utility.

Pages: 47 to 52

Copyright: Copyright (c) IARIA, 2019

Publication date: June 30, 2019

Published in: conference

ISSN: 2308-4219

ISBN: 978-1-61208-719-1

Location: Rome, Italy

Dates: from June 30, 2019 to July 4, 2019