Quasi-Identifier Recognition Algorithm for Privacy Preservation of Cloud Data Based on Risk Reidentification

Cloud computing plays an essential role as a source for outsourcing data to perform mining operations or other data processing, especially for data owners who do not have sufficient resources or experience to execute data mining techniques. However, the privacy of outsourced data is a serious concer...

Full description

Bibliographic Details
Main Authors: Huda O. Mansour, Maheyzah M. Siraj, Fuad A. Ghaleb, Faisal Saeed, Eman H. Alkhammash, Mohd A. Maarof
Format: Article
Language:English
Published: Hindawi-Wiley 2021-01-01
Series:Wireless Communications and Mobile Computing
Online Access:http://dx.doi.org/10.1155/2021/7154705
id doaj-73b5087c45e14a68b9b500229f337d17
record_format Article
spelling doaj-73b5087c45e14a68b9b500229f337d172021-09-06T00:00:44ZengHindawi-WileyWireless Communications and Mobile Computing1530-86772021-01-01202110.1155/2021/7154705Quasi-Identifier Recognition Algorithm for Privacy Preservation of Cloud Data Based on Risk ReidentificationHuda O. Mansour0Maheyzah M. Siraj1Fuad A. Ghaleb2Faisal Saeed3Eman H. Alkhammash4Mohd A. Maarof5Faculty of EngineeringDepartment of Computer ScienceFaculty of EngineeringCollege of Computer Science and EngineeringDepartment of Computer ScienceFaculty of EngineeringCloud computing plays an essential role as a source for outsourcing data to perform mining operations or other data processing, especially for data owners who do not have sufficient resources or experience to execute data mining techniques. However, the privacy of outsourced data is a serious concern. Most data owners are using anonymization-based techniques to prevent identity and attribute disclosures to avoid privacy leakage before outsourced data for mining over the cloud. In addition, data collection and dissemination in a resource-limited network such as sensor cloud require efficient methods to reduce privacy leakage. The main issue that caused identity disclosure is quasi-identifier (QID) linking. But most researchers of anonymization methods ignore the identification of proper QIDs. This reduces the validity of the used anonymization methods and may thus lead to a failure of the anonymity process. This paper introduces a new quasi-identifier recognition algorithm that reduces identity disclosure which resulted from QID linking. The proposed algorithm is comprised of two main stages: (1) attribute classification (or QID recognition) and (2) QID dimension identification. The algorithm works based on the reidentification of risk rate for all attributes and the dimension of QIDs where it determines the proper QIDs and their suitable dimensions. The proposed algorithm was tested on a real dataset. The results demonstrated that the proposed algorithm significantly reduces privacy leakage and maintains the data utility compared to recent related algorithms.http://dx.doi.org/10.1155/2021/7154705
collection DOAJ
language English
format Article
sources DOAJ
author Huda O. Mansour
Maheyzah M. Siraj
Fuad A. Ghaleb
Faisal Saeed
Eman H. Alkhammash
Mohd A. Maarof
spellingShingle Huda O. Mansour
Maheyzah M. Siraj
Fuad A. Ghaleb
Faisal Saeed
Eman H. Alkhammash
Mohd A. Maarof
Quasi-Identifier Recognition Algorithm for Privacy Preservation of Cloud Data Based on Risk Reidentification
Wireless Communications and Mobile Computing
author_facet Huda O. Mansour
Maheyzah M. Siraj
Fuad A. Ghaleb
Faisal Saeed
Eman H. Alkhammash
Mohd A. Maarof
author_sort Huda O. Mansour
title Quasi-Identifier Recognition Algorithm for Privacy Preservation of Cloud Data Based on Risk Reidentification
title_short Quasi-Identifier Recognition Algorithm for Privacy Preservation of Cloud Data Based on Risk Reidentification
title_full Quasi-Identifier Recognition Algorithm for Privacy Preservation of Cloud Data Based on Risk Reidentification
title_fullStr Quasi-Identifier Recognition Algorithm for Privacy Preservation of Cloud Data Based on Risk Reidentification
title_full_unstemmed Quasi-Identifier Recognition Algorithm for Privacy Preservation of Cloud Data Based on Risk Reidentification
title_sort quasi-identifier recognition algorithm for privacy preservation of cloud data based on risk reidentification
publisher Hindawi-Wiley
series Wireless Communications and Mobile Computing
issn 1530-8677
publishDate 2021-01-01
description Cloud computing plays an essential role as a source for outsourcing data to perform mining operations or other data processing, especially for data owners who do not have sufficient resources or experience to execute data mining techniques. However, the privacy of outsourced data is a serious concern. Most data owners are using anonymization-based techniques to prevent identity and attribute disclosures to avoid privacy leakage before outsourced data for mining over the cloud. In addition, data collection and dissemination in a resource-limited network such as sensor cloud require efficient methods to reduce privacy leakage. The main issue that caused identity disclosure is quasi-identifier (QID) linking. But most researchers of anonymization methods ignore the identification of proper QIDs. This reduces the validity of the used anonymization methods and may thus lead to a failure of the anonymity process. This paper introduces a new quasi-identifier recognition algorithm that reduces identity disclosure which resulted from QID linking. The proposed algorithm is comprised of two main stages: (1) attribute classification (or QID recognition) and (2) QID dimension identification. The algorithm works based on the reidentification of risk rate for all attributes and the dimension of QIDs where it determines the proper QIDs and their suitable dimensions. The proposed algorithm was tested on a real dataset. The results demonstrated that the proposed algorithm significantly reduces privacy leakage and maintains the data utility compared to recent related algorithms.
url http://dx.doi.org/10.1155/2021/7154705
work_keys_str_mv AT hudaomansour quasiidentifierrecognitionalgorithmforprivacypreservationofclouddatabasedonriskreidentification
AT maheyzahmsiraj quasiidentifierrecognitionalgorithmforprivacypreservationofclouddatabasedonriskreidentification
AT fuadaghaleb quasiidentifierrecognitionalgorithmforprivacypreservationofclouddatabasedonriskreidentification
AT faisalsaeed quasiidentifierrecognitionalgorithmforprivacypreservationofclouddatabasedonriskreidentification
AT emanhalkhammash quasiidentifierrecognitionalgorithmforprivacypreservationofclouddatabasedonriskreidentification
AT mohdamaarof quasiidentifierrecognitionalgorithmforprivacypreservationofclouddatabasedonriskreidentification
_version_ 1717780271050260480