Communication Pattern Based Data Authentication (CPDA) Designed for Big Data Processing in a Multiple Public Cloud Environment
With the development of cloud computing, there is a growing trend of multi-cloud Collaborative Big Data Computation (CBDC). In this environment, threats from authorized insiders are of particular concerns. Based on an extreme case of distributed computation where multiple collaborators jointly perfo...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9112146/ |
id |
doaj-1a07851ad80347d59a6d8555b36d12cb |
---|---|
record_format |
Article |
spelling |
doaj-1a07851ad80347d59a6d8555b36d12cb2021-03-30T02:58:02ZengIEEEIEEE Access2169-35362020-01-01810771610774810.1109/ACCESS.2020.30009899112146Communication Pattern Based Data Authentication (CPDA) Designed for Big Data Processing in a Multiple Public Cloud EnvironmentSoontorn Sirapaisan0https://orcid.org/0000-0002-5411-4471Ning Zhang1https://orcid.org/0000-0001-9519-9128Qian He2https://orcid.org/0000-0003-3020-2896Department of Computer Science, The University of Manchester, Manchester, U.K.Department of Computer Science, The University of Manchester, Manchester, U.K.School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin, ChinaWith the development of cloud computing, there is a growing trend of multi-cloud Collaborative Big Data Computation (CBDC). In this environment, threats from authorized insiders are of particular concerns. Based on an extreme case of distributed computation where multiple collaborators jointly perform CBDC on shared datasets using an example distributed computing framework, MapReduce (MR), deployed in a Multiple Public Cloud (MPC) environment, this paper investigates how to protect the authenticity of data used during the computation in an efficient and scalable manner by proposing and evaluating a novel data authentication solution. The solution, called a Communication Pattern based Data Authentication (CPDA) framework, ensures data authenticity and non-repudiation of origin at the finest granularity without compromising efficiency and scalability. This is achieved by using an idea of communication pattern based authentication data aggregation. The framework has been comprehensively evaluated both theoretically and experimentally. The evaluation results show that the CPDA framework offers the strongest level of data authenticity protection (equivalent to that provided by digitally signing each data object individually) but introduces much lower overhead cost than the digital signature based solution. The results demonstrate that the idea of communication pattern based authentication data aggregation brings much benefit in terms of supporting efficient and scalable data authentication in a large-scale distributed system.https://ieeexplore.ieee.org/document/9112146/Big dataclouddata authenticationdistributed computingMapReduce |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Soontorn Sirapaisan Ning Zhang Qian He |
spellingShingle |
Soontorn Sirapaisan Ning Zhang Qian He Communication Pattern Based Data Authentication (CPDA) Designed for Big Data Processing in a Multiple Public Cloud Environment IEEE Access Big data cloud data authentication distributed computing MapReduce |
author_facet |
Soontorn Sirapaisan Ning Zhang Qian He |
author_sort |
Soontorn Sirapaisan |
title |
Communication Pattern Based Data Authentication (CPDA) Designed for Big Data Processing in a Multiple Public Cloud Environment |
title_short |
Communication Pattern Based Data Authentication (CPDA) Designed for Big Data Processing in a Multiple Public Cloud Environment |
title_full |
Communication Pattern Based Data Authentication (CPDA) Designed for Big Data Processing in a Multiple Public Cloud Environment |
title_fullStr |
Communication Pattern Based Data Authentication (CPDA) Designed for Big Data Processing in a Multiple Public Cloud Environment |
title_full_unstemmed |
Communication Pattern Based Data Authentication (CPDA) Designed for Big Data Processing in a Multiple Public Cloud Environment |
title_sort |
communication pattern based data authentication (cpda) designed for big data processing in a multiple public cloud environment |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
With the development of cloud computing, there is a growing trend of multi-cloud Collaborative Big Data Computation (CBDC). In this environment, threats from authorized insiders are of particular concerns. Based on an extreme case of distributed computation where multiple collaborators jointly perform CBDC on shared datasets using an example distributed computing framework, MapReduce (MR), deployed in a Multiple Public Cloud (MPC) environment, this paper investigates how to protect the authenticity of data used during the computation in an efficient and scalable manner by proposing and evaluating a novel data authentication solution. The solution, called a Communication Pattern based Data Authentication (CPDA) framework, ensures data authenticity and non-repudiation of origin at the finest granularity without compromising efficiency and scalability. This is achieved by using an idea of communication pattern based authentication data aggregation. The framework has been comprehensively evaluated both theoretically and experimentally. The evaluation results show that the CPDA framework offers the strongest level of data authenticity protection (equivalent to that provided by digitally signing each data object individually) but introduces much lower overhead cost than the digital signature based solution. The results demonstrate that the idea of communication pattern based authentication data aggregation brings much benefit in terms of supporting efficient and scalable data authentication in a large-scale distributed system. |
topic |
Big data cloud data authentication distributed computing MapReduce |
url |
https://ieeexplore.ieee.org/document/9112146/ |
work_keys_str_mv |
AT soontornsirapaisan communicationpatternbaseddataauthenticationcpdadesignedforbigdataprocessinginamultiplepubliccloudenvironment AT ningzhang communicationpatternbaseddataauthenticationcpdadesignedforbigdataprocessinginamultiplepubliccloudenvironment AT qianhe communicationpatternbaseddataauthenticationcpdadesignedforbigdataprocessinginamultiplepubliccloudenvironment |
_version_ |
1724184269951074304 |