Communication Pattern based Data Authentication (CPDA) Designed for Big Data Processing in a Multiple Public Cloud Environment

Soontorn Sirapaisan, Ning Zhang, Qian He

Research output: Contribution to journalArticlepeer-review

167 Downloads (Pure)

Abstract

With the development of cloud computing, there is a growing trend of multi-cloud Collaborative Big Data Computation (CBDC). In this environment, threats from authorized insiders are of particular concerns. Based on an extreme case of distributed computation where multiple collaborators jointly perform CBDC on shared datasets using an example distributed computing framework, MapReduce (MR), deployed in a Multiple Public Cloud (MPC) environment, this paper investigates how to protect the authenticity of data used during the computation in an efficient and scalable manner by proposing and evaluating a novel data authentication solution. The solution, called a Communication Pattern based Data Authentication (CPDA) framework, ensures data authenticity and non-repudiation of origin at the finest granularity without compromising efficiency and scalability. This is achieved by using an idea of communication pattern based authentication data aggregation. The framework has been comprehensively evaluated both theoretically and experimentally. The evaluation results show that the CPDA framework offers the strongest level of data authenticity protection (equivalent to that provided by digitally signing each data object individually) but introduces much lower overhead cost than the digital signature based solution. The results demonstrate that the idea of communication pattern based authentication data aggregation brings much benefit in terms of supporting efficient and scalable data authentication in a large-scale distributed system.
Original languageEnglish
Article number9112146
Pages (from-to)107716-107748
Number of pages33
JournalIEEE Access
Volume8
DOIs
Publication statusPublished - 9 Jun 2020

Keywords

  • Big data
  • MapReduce
  • cloud
  • data authentication
  • distributed computing

Fingerprint

Dive into the research topics of 'Communication Pattern based Data Authentication (CPDA) Designed for Big Data Processing in a Multiple Public Cloud Environment'. Together they form a unique fingerprint.

Cite this