An anonymization technique using intersected decision trees

Data mining plays an important role in analyzing the massive amount of data collected in today’s world. However, due to the public’s rising awareness of privacy and lack of trust in organizations, suitable Privacy Preserving Data Mining (PPDM) techniques have become vital. A PPDM technique provides...

Full description

Bibliographic Details
Main Authors: Sam Fletcher, Md Zahidul Islam
Format: Article
Language:English
Published: Elsevier 2015-07-01
Series:Journal of King Saud University: Computer and Information Sciences
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1319157815000452
id doaj-6b43bdd30f994d01b8c862a694fb18a6
record_format Article
spelling doaj-6b43bdd30f994d01b8c862a694fb18a62020-11-25T01:09:34ZengElsevierJournal of King Saud University: Computer and Information Sciences1319-15782015-07-0127329730410.1016/j.jksuci.2014.06.015An anonymization technique using intersected decision treesSam FletcherMd Zahidul IslamData mining plays an important role in analyzing the massive amount of data collected in today’s world. However, due to the public’s rising awareness of privacy and lack of trust in organizations, suitable Privacy Preserving Data Mining (PPDM) techniques have become vital. A PPDM technique provides individual privacy while allowing useful data mining. We present a novel noise addition technique called Forest Framework, two novel data quality evaluation techniques called EDUDS and EDUSC, and a security evaluation technique called SERS. Forest Framework builds a decision forest from a dataset and preserves all the patterns (logic rules) of the forest while adding noise to the dataset. We compare Forest Framework to its predecessor, Framework, and another established technique, GADP. Our comparison is done using our three evaluation criteria, as well as Prediction Accuracy. Our experimental results demonstrate the success of our proposed extensions to Framework and the usefulness of our evaluation criteria.http://www.sciencedirect.com/science/article/pii/S1319157815000452Privacy preserving data miningDecision treeAnonymizationData miningData quality
collection DOAJ
language English
format Article
sources DOAJ
author Sam Fletcher
Md Zahidul Islam
spellingShingle Sam Fletcher
Md Zahidul Islam
An anonymization technique using intersected decision trees
Journal of King Saud University: Computer and Information Sciences
Privacy preserving data mining
Decision tree
Anonymization
Data mining
Data quality
author_facet Sam Fletcher
Md Zahidul Islam
author_sort Sam Fletcher
title An anonymization technique using intersected decision trees
title_short An anonymization technique using intersected decision trees
title_full An anonymization technique using intersected decision trees
title_fullStr An anonymization technique using intersected decision trees
title_full_unstemmed An anonymization technique using intersected decision trees
title_sort anonymization technique using intersected decision trees
publisher Elsevier
series Journal of King Saud University: Computer and Information Sciences
issn 1319-1578
publishDate 2015-07-01
description Data mining plays an important role in analyzing the massive amount of data collected in today’s world. However, due to the public’s rising awareness of privacy and lack of trust in organizations, suitable Privacy Preserving Data Mining (PPDM) techniques have become vital. A PPDM technique provides individual privacy while allowing useful data mining. We present a novel noise addition technique called Forest Framework, two novel data quality evaluation techniques called EDUDS and EDUSC, and a security evaluation technique called SERS. Forest Framework builds a decision forest from a dataset and preserves all the patterns (logic rules) of the forest while adding noise to the dataset. We compare Forest Framework to its predecessor, Framework, and another established technique, GADP. Our comparison is done using our three evaluation criteria, as well as Prediction Accuracy. Our experimental results demonstrate the success of our proposed extensions to Framework and the usefulness of our evaluation criteria.
topic Privacy preserving data mining
Decision tree
Anonymization
Data mining
Data quality
url http://www.sciencedirect.com/science/article/pii/S1319157815000452
work_keys_str_mv AT samfletcher ananonymizationtechniqueusingintersecteddecisiontrees
AT mdzahidulislam ananonymizationtechniqueusingintersecteddecisiontrees
AT samfletcher anonymizationtechniqueusingintersecteddecisiontrees
AT mdzahidulislam anonymizationtechniqueusingintersecteddecisiontrees
_version_ 1725178026288742400