An anonymization technique using intersected decision trees
Data mining plays an important role in analyzing the massive amount of data collected in today’s world. However, due to the public’s rising awareness of privacy and lack of trust in organizations, suitable Privacy Preserving Data Mining (PPDM) techniques have become vital. A PPDM technique provides...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2015-07-01
|
Series: | Journal of King Saud University: Computer and Information Sciences |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S1319157815000452 |
id |
doaj-6b43bdd30f994d01b8c862a694fb18a6 |
---|---|
record_format |
Article |
spelling |
doaj-6b43bdd30f994d01b8c862a694fb18a62020-11-25T01:09:34ZengElsevierJournal of King Saud University: Computer and Information Sciences1319-15782015-07-0127329730410.1016/j.jksuci.2014.06.015An anonymization technique using intersected decision treesSam FletcherMd Zahidul IslamData mining plays an important role in analyzing the massive amount of data collected in today’s world. However, due to the public’s rising awareness of privacy and lack of trust in organizations, suitable Privacy Preserving Data Mining (PPDM) techniques have become vital. A PPDM technique provides individual privacy while allowing useful data mining. We present a novel noise addition technique called Forest Framework, two novel data quality evaluation techniques called EDUDS and EDUSC, and a security evaluation technique called SERS. Forest Framework builds a decision forest from a dataset and preserves all the patterns (logic rules) of the forest while adding noise to the dataset. We compare Forest Framework to its predecessor, Framework, and another established technique, GADP. Our comparison is done using our three evaluation criteria, as well as Prediction Accuracy. Our experimental results demonstrate the success of our proposed extensions to Framework and the usefulness of our evaluation criteria.http://www.sciencedirect.com/science/article/pii/S1319157815000452Privacy preserving data miningDecision treeAnonymizationData miningData quality |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Sam Fletcher Md Zahidul Islam |
spellingShingle |
Sam Fletcher Md Zahidul Islam An anonymization technique using intersected decision trees Journal of King Saud University: Computer and Information Sciences Privacy preserving data mining Decision tree Anonymization Data mining Data quality |
author_facet |
Sam Fletcher Md Zahidul Islam |
author_sort |
Sam Fletcher |
title |
An anonymization technique using intersected decision trees |
title_short |
An anonymization technique using intersected decision trees |
title_full |
An anonymization technique using intersected decision trees |
title_fullStr |
An anonymization technique using intersected decision trees |
title_full_unstemmed |
An anonymization technique using intersected decision trees |
title_sort |
anonymization technique using intersected decision trees |
publisher |
Elsevier |
series |
Journal of King Saud University: Computer and Information Sciences |
issn |
1319-1578 |
publishDate |
2015-07-01 |
description |
Data mining plays an important role in analyzing the massive amount of data collected in today’s world. However, due to the public’s rising awareness of privacy and lack of trust in organizations, suitable Privacy Preserving Data Mining (PPDM) techniques have become vital. A PPDM technique provides individual privacy while allowing useful data mining. We present a novel noise addition technique called Forest Framework, two novel data quality evaluation techniques called EDUDS and EDUSC, and a security evaluation technique called SERS. Forest Framework builds a decision forest from a dataset and preserves all the patterns (logic rules) of the forest while adding noise to the dataset. We compare Forest Framework to its predecessor, Framework, and another established technique, GADP. Our comparison is done using our three evaluation criteria, as well as Prediction Accuracy. Our experimental results demonstrate the success of our proposed extensions to Framework and the usefulness of our evaluation criteria. |
topic |
Privacy preserving data mining Decision tree Anonymization Data mining Data quality |
url |
http://www.sciencedirect.com/science/article/pii/S1319157815000452 |
work_keys_str_mv |
AT samfletcher ananonymizationtechniqueusingintersecteddecisiontrees AT mdzahidulislam ananonymizationtechniqueusingintersecteddecisiontrees AT samfletcher anonymizationtechniqueusingintersecteddecisiontrees AT mdzahidulislam anonymizationtechniqueusingintersecteddecisiontrees |
_version_ |
1725178026288742400 |