A Data Mining Method Using Deep Learning for Anomaly Detection in Cloud Computing Environment

Aiming at problems such as slow training speed, poor prediction effect, and unstable detection results of traditional anomaly detection algorithms, a data mining method for anomaly detection based on the deep variational dimensionality reduction model and MapReduce (DMAD-DVDMR) in cloud computing en...

Full description

Bibliographic Details
Main Authors: Jin Gao, Jiaquan Liu, Sihua Guo, Qi Zhang, Xinyang Wang
Format: Article
Language:English
Published: Hindawi Limited 2020-01-01
Series:Mathematical Problems in Engineering
Online Access:http://dx.doi.org/10.1155/2020/6343705
id doaj-0730c9ab0d5c4a9e872a33adf2010c3a
record_format Article
spelling doaj-0730c9ab0d5c4a9e872a33adf2010c3a2020-11-30T09:11:29ZengHindawi LimitedMathematical Problems in Engineering1024-123X1563-51472020-01-01202010.1155/2020/63437056343705A Data Mining Method Using Deep Learning for Anomaly Detection in Cloud Computing EnvironmentJin Gao0Jiaquan Liu1Sihua Guo2Qi Zhang3Xinyang Wang4State Grid Chongqing Electric Power Research Institute, Chongqing, ChinaState Grid Chongqing Electric Power Company, Chongqing, ChinaState Grid Chongqing Electric Power Research Institute, Chongqing, ChinaState Grid Chongqing Electric Power Company Maintenance Company, Chongqing, ChinaState Key Laboratory of Power Transmission Equipment and System Security and New Technology, Chongqing University, Chongqing, ChinaAiming at problems such as slow training speed, poor prediction effect, and unstable detection results of traditional anomaly detection algorithms, a data mining method for anomaly detection based on the deep variational dimensionality reduction model and MapReduce (DMAD-DVDMR) in cloud computing environment is proposed. First of all, the data are preprocessed by a dimensionality reduction model based on deep variational learning and based on ensuring complete data information as much as possible, the dimensionality of the data is reduced, and the computational pressure is reduced. Secondly, the data set stored on the Hadoop Distributed File System (HDFS) is logically divided into several data blocks, and the data blocks are processed in parallel through the principle of MapReduce, so the k-distance and LOF value of each data point can only be calculated in each block. Thirdly, based on stochastic gradient descent, the concept of k-neighboring distance is redefined, thus avoiding the situation where there are greater than or equal to k-repeated points and infinite local density in the data set. Finally, compared with CNN, DeepAnt, and SVM-IDS algorithms, the accuracy of the scheme is increased by 10.3%, 18.0%, and 17.2%, respectively. The experimental data set verifies the effectiveness and scalability of the proposed DMAD-DVDMR algorithm.http://dx.doi.org/10.1155/2020/6343705
collection DOAJ
language English
format Article
sources DOAJ
author Jin Gao
Jiaquan Liu
Sihua Guo
Qi Zhang
Xinyang Wang
spellingShingle Jin Gao
Jiaquan Liu
Sihua Guo
Qi Zhang
Xinyang Wang
A Data Mining Method Using Deep Learning for Anomaly Detection in Cloud Computing Environment
Mathematical Problems in Engineering
author_facet Jin Gao
Jiaquan Liu
Sihua Guo
Qi Zhang
Xinyang Wang
author_sort Jin Gao
title A Data Mining Method Using Deep Learning for Anomaly Detection in Cloud Computing Environment
title_short A Data Mining Method Using Deep Learning for Anomaly Detection in Cloud Computing Environment
title_full A Data Mining Method Using Deep Learning for Anomaly Detection in Cloud Computing Environment
title_fullStr A Data Mining Method Using Deep Learning for Anomaly Detection in Cloud Computing Environment
title_full_unstemmed A Data Mining Method Using Deep Learning for Anomaly Detection in Cloud Computing Environment
title_sort data mining method using deep learning for anomaly detection in cloud computing environment
publisher Hindawi Limited
series Mathematical Problems in Engineering
issn 1024-123X
1563-5147
publishDate 2020-01-01
description Aiming at problems such as slow training speed, poor prediction effect, and unstable detection results of traditional anomaly detection algorithms, a data mining method for anomaly detection based on the deep variational dimensionality reduction model and MapReduce (DMAD-DVDMR) in cloud computing environment is proposed. First of all, the data are preprocessed by a dimensionality reduction model based on deep variational learning and based on ensuring complete data information as much as possible, the dimensionality of the data is reduced, and the computational pressure is reduced. Secondly, the data set stored on the Hadoop Distributed File System (HDFS) is logically divided into several data blocks, and the data blocks are processed in parallel through the principle of MapReduce, so the k-distance and LOF value of each data point can only be calculated in each block. Thirdly, based on stochastic gradient descent, the concept of k-neighboring distance is redefined, thus avoiding the situation where there are greater than or equal to k-repeated points and infinite local density in the data set. Finally, compared with CNN, DeepAnt, and SVM-IDS algorithms, the accuracy of the scheme is increased by 10.3%, 18.0%, and 17.2%, respectively. The experimental data set verifies the effectiveness and scalability of the proposed DMAD-DVDMR algorithm.
url http://dx.doi.org/10.1155/2020/6343705
work_keys_str_mv AT jingao adataminingmethodusingdeeplearningforanomalydetectionincloudcomputingenvironment
AT jiaquanliu adataminingmethodusingdeeplearningforanomalydetectionincloudcomputingenvironment
AT sihuaguo adataminingmethodusingdeeplearningforanomalydetectionincloudcomputingenvironment
AT qizhang adataminingmethodusingdeeplearningforanomalydetectionincloudcomputingenvironment
AT xinyangwang adataminingmethodusingdeeplearningforanomalydetectionincloudcomputingenvironment
AT jingao dataminingmethodusingdeeplearningforanomalydetectionincloudcomputingenvironment
AT jiaquanliu dataminingmethodusingdeeplearningforanomalydetectionincloudcomputingenvironment
AT sihuaguo dataminingmethodusingdeeplearningforanomalydetectionincloudcomputingenvironment
AT qizhang dataminingmethodusingdeeplearningforanomalydetectionincloudcomputingenvironment
AT xinyangwang dataminingmethodusingdeeplearningforanomalydetectionincloudcomputingenvironment
_version_ 1715027760618930176