Communication Pattern Based Data Authentication (CPDA) Designed for Big Data Processing in a Multiple Public Cloud Environment

With the development of cloud computing, there is a growing trend of multi-cloud Collaborative Big Data Computation (CBDC). In this environment, threats from authorized insiders are of particular concerns. Based on an extreme case of distributed computation where multiple collaborators jointly perfo...

Full description

Bibliographic Details
Main Authors: Soontorn Sirapaisan, Ning Zhang, Qian He
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9112146/
id doaj-1a07851ad80347d59a6d8555b36d12cb
record_format Article
spelling doaj-1a07851ad80347d59a6d8555b36d12cb2021-03-30T02:58:02ZengIEEEIEEE Access2169-35362020-01-01810771610774810.1109/ACCESS.2020.30009899112146Communication Pattern Based Data Authentication (CPDA) Designed for Big Data Processing in a Multiple Public Cloud EnvironmentSoontorn Sirapaisan0https://orcid.org/0000-0002-5411-4471Ning Zhang1https://orcid.org/0000-0001-9519-9128Qian He2https://orcid.org/0000-0003-3020-2896Department of Computer Science, The University of Manchester, Manchester, U.K.Department of Computer Science, The University of Manchester, Manchester, U.K.School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin, ChinaWith the development of cloud computing, there is a growing trend of multi-cloud Collaborative Big Data Computation (CBDC). In this environment, threats from authorized insiders are of particular concerns. Based on an extreme case of distributed computation where multiple collaborators jointly perform CBDC on shared datasets using an example distributed computing framework, MapReduce (MR), deployed in a Multiple Public Cloud (MPC) environment, this paper investigates how to protect the authenticity of data used during the computation in an efficient and scalable manner by proposing and evaluating a novel data authentication solution. The solution, called a Communication Pattern based Data Authentication (CPDA) framework, ensures data authenticity and non-repudiation of origin at the finest granularity without compromising efficiency and scalability. This is achieved by using an idea of communication pattern based authentication data aggregation. The framework has been comprehensively evaluated both theoretically and experimentally. The evaluation results show that the CPDA framework offers the strongest level of data authenticity protection (equivalent to that provided by digitally signing each data object individually) but introduces much lower overhead cost than the digital signature based solution. The results demonstrate that the idea of communication pattern based authentication data aggregation brings much benefit in terms of supporting efficient and scalable data authentication in a large-scale distributed system.https://ieeexplore.ieee.org/document/9112146/Big dataclouddata authenticationdistributed computingMapReduce
collection DOAJ
language English
format Article
sources DOAJ
author Soontorn Sirapaisan
Ning Zhang
Qian He
spellingShingle Soontorn Sirapaisan
Ning Zhang
Qian He
Communication Pattern Based Data Authentication (CPDA) Designed for Big Data Processing in a Multiple Public Cloud Environment
IEEE Access
Big data
cloud
data authentication
distributed computing
MapReduce
author_facet Soontorn Sirapaisan
Ning Zhang
Qian He
author_sort Soontorn Sirapaisan
title Communication Pattern Based Data Authentication (CPDA) Designed for Big Data Processing in a Multiple Public Cloud Environment
title_short Communication Pattern Based Data Authentication (CPDA) Designed for Big Data Processing in a Multiple Public Cloud Environment
title_full Communication Pattern Based Data Authentication (CPDA) Designed for Big Data Processing in a Multiple Public Cloud Environment
title_fullStr Communication Pattern Based Data Authentication (CPDA) Designed for Big Data Processing in a Multiple Public Cloud Environment
title_full_unstemmed Communication Pattern Based Data Authentication (CPDA) Designed for Big Data Processing in a Multiple Public Cloud Environment
title_sort communication pattern based data authentication (cpda) designed for big data processing in a multiple public cloud environment
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description With the development of cloud computing, there is a growing trend of multi-cloud Collaborative Big Data Computation (CBDC). In this environment, threats from authorized insiders are of particular concerns. Based on an extreme case of distributed computation where multiple collaborators jointly perform CBDC on shared datasets using an example distributed computing framework, MapReduce (MR), deployed in a Multiple Public Cloud (MPC) environment, this paper investigates how to protect the authenticity of data used during the computation in an efficient and scalable manner by proposing and evaluating a novel data authentication solution. The solution, called a Communication Pattern based Data Authentication (CPDA) framework, ensures data authenticity and non-repudiation of origin at the finest granularity without compromising efficiency and scalability. This is achieved by using an idea of communication pattern based authentication data aggregation. The framework has been comprehensively evaluated both theoretically and experimentally. The evaluation results show that the CPDA framework offers the strongest level of data authenticity protection (equivalent to that provided by digitally signing each data object individually) but introduces much lower overhead cost than the digital signature based solution. The results demonstrate that the idea of communication pattern based authentication data aggregation brings much benefit in terms of supporting efficient and scalable data authentication in a large-scale distributed system.
topic Big data
cloud
data authentication
distributed computing
MapReduce
url https://ieeexplore.ieee.org/document/9112146/
work_keys_str_mv AT soontornsirapaisan communicationpatternbaseddataauthenticationcpdadesignedforbigdataprocessinginamultiplepubliccloudenvironment
AT ningzhang communicationpatternbaseddataauthenticationcpdadesignedforbigdataprocessinginamultiplepubliccloudenvironment
AT qianhe communicationpatternbaseddataauthenticationcpdadesignedforbigdataprocessinginamultiplepubliccloudenvironment
_version_ 1724184269951074304