A Data Mining Method Using Deep Learning for Anomaly Detection in Cloud Computing Environment
Aiming at problems such as slow training speed, poor prediction effect, and unstable detection results of traditional anomaly detection algorithms, a data mining method for anomaly detection based on the deep variational dimensionality reduction model and MapReduce (DMAD-DVDMR) in cloud computing en...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2020-01-01
|
Series: | Mathematical Problems in Engineering |
Online Access: | http://dx.doi.org/10.1155/2020/6343705 |
id |
doaj-0730c9ab0d5c4a9e872a33adf2010c3a |
---|---|
record_format |
Article |
spelling |
doaj-0730c9ab0d5c4a9e872a33adf2010c3a2020-11-30T09:11:29ZengHindawi LimitedMathematical Problems in Engineering1024-123X1563-51472020-01-01202010.1155/2020/63437056343705A Data Mining Method Using Deep Learning for Anomaly Detection in Cloud Computing EnvironmentJin Gao0Jiaquan Liu1Sihua Guo2Qi Zhang3Xinyang Wang4State Grid Chongqing Electric Power Research Institute, Chongqing, ChinaState Grid Chongqing Electric Power Company, Chongqing, ChinaState Grid Chongqing Electric Power Research Institute, Chongqing, ChinaState Grid Chongqing Electric Power Company Maintenance Company, Chongqing, ChinaState Key Laboratory of Power Transmission Equipment and System Security and New Technology, Chongqing University, Chongqing, ChinaAiming at problems such as slow training speed, poor prediction effect, and unstable detection results of traditional anomaly detection algorithms, a data mining method for anomaly detection based on the deep variational dimensionality reduction model and MapReduce (DMAD-DVDMR) in cloud computing environment is proposed. First of all, the data are preprocessed by a dimensionality reduction model based on deep variational learning and based on ensuring complete data information as much as possible, the dimensionality of the data is reduced, and the computational pressure is reduced. Secondly, the data set stored on the Hadoop Distributed File System (HDFS) is logically divided into several data blocks, and the data blocks are processed in parallel through the principle of MapReduce, so the k-distance and LOF value of each data point can only be calculated in each block. Thirdly, based on stochastic gradient descent, the concept of k-neighboring distance is redefined, thus avoiding the situation where there are greater than or equal to k-repeated points and infinite local density in the data set. Finally, compared with CNN, DeepAnt, and SVM-IDS algorithms, the accuracy of the scheme is increased by 10.3%, 18.0%, and 17.2%, respectively. The experimental data set verifies the effectiveness and scalability of the proposed DMAD-DVDMR algorithm.http://dx.doi.org/10.1155/2020/6343705 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jin Gao Jiaquan Liu Sihua Guo Qi Zhang Xinyang Wang |
spellingShingle |
Jin Gao Jiaquan Liu Sihua Guo Qi Zhang Xinyang Wang A Data Mining Method Using Deep Learning for Anomaly Detection in Cloud Computing Environment Mathematical Problems in Engineering |
author_facet |
Jin Gao Jiaquan Liu Sihua Guo Qi Zhang Xinyang Wang |
author_sort |
Jin Gao |
title |
A Data Mining Method Using Deep Learning for Anomaly Detection in Cloud Computing Environment |
title_short |
A Data Mining Method Using Deep Learning for Anomaly Detection in Cloud Computing Environment |
title_full |
A Data Mining Method Using Deep Learning for Anomaly Detection in Cloud Computing Environment |
title_fullStr |
A Data Mining Method Using Deep Learning for Anomaly Detection in Cloud Computing Environment |
title_full_unstemmed |
A Data Mining Method Using Deep Learning for Anomaly Detection in Cloud Computing Environment |
title_sort |
data mining method using deep learning for anomaly detection in cloud computing environment |
publisher |
Hindawi Limited |
series |
Mathematical Problems in Engineering |
issn |
1024-123X 1563-5147 |
publishDate |
2020-01-01 |
description |
Aiming at problems such as slow training speed, poor prediction effect, and unstable detection results of traditional anomaly detection algorithms, a data mining method for anomaly detection based on the deep variational dimensionality reduction model and MapReduce (DMAD-DVDMR) in cloud computing environment is proposed. First of all, the data are preprocessed by a dimensionality reduction model based on deep variational learning and based on ensuring complete data information as much as possible, the dimensionality of the data is reduced, and the computational pressure is reduced. Secondly, the data set stored on the Hadoop Distributed File System (HDFS) is logically divided into several data blocks, and the data blocks are processed in parallel through the principle of MapReduce, so the k-distance and LOF value of each data point can only be calculated in each block. Thirdly, based on stochastic gradient descent, the concept of k-neighboring distance is redefined, thus avoiding the situation where there are greater than or equal to k-repeated points and infinite local density in the data set. Finally, compared with CNN, DeepAnt, and SVM-IDS algorithms, the accuracy of the scheme is increased by 10.3%, 18.0%, and 17.2%, respectively. The experimental data set verifies the effectiveness and scalability of the proposed DMAD-DVDMR algorithm. |
url |
http://dx.doi.org/10.1155/2020/6343705 |
work_keys_str_mv |
AT jingao adataminingmethodusingdeeplearningforanomalydetectionincloudcomputingenvironment AT jiaquanliu adataminingmethodusingdeeplearningforanomalydetectionincloudcomputingenvironment AT sihuaguo adataminingmethodusingdeeplearningforanomalydetectionincloudcomputingenvironment AT qizhang adataminingmethodusingdeeplearningforanomalydetectionincloudcomputingenvironment AT xinyangwang adataminingmethodusingdeeplearningforanomalydetectionincloudcomputingenvironment AT jingao dataminingmethodusingdeeplearningforanomalydetectionincloudcomputingenvironment AT jiaquanliu dataminingmethodusingdeeplearningforanomalydetectionincloudcomputingenvironment AT sihuaguo dataminingmethodusingdeeplearningforanomalydetectionincloudcomputingenvironment AT qizhang dataminingmethodusingdeeplearningforanomalydetectionincloudcomputingenvironment AT xinyangwang dataminingmethodusingdeeplearningforanomalydetectionincloudcomputingenvironment |
_version_ |
1715027760618930176 |