Detecting Malicious URLs via a Keyword-Based Convolutional Gated-Recurrent-Unit Neural Network

With the continuous development of Web attacks, many web applications have been suffering from various forms of security threats and network attacks. The security detection of URLs has always been the focus of Web security. Many web application resources can be accessed by simply entering an URL or...

Full description

Bibliographic Details
Main Authors: Wenchuan Yang, Wen Zuo, Baojiang Cui
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8629082/
id doaj-60e1a35caa7e4f0b8a624eccb9476fe5
record_format Article
spelling doaj-60e1a35caa7e4f0b8a624eccb9476fe52021-03-29T22:18:54ZengIEEEIEEE Access2169-35362019-01-017298912990010.1109/ACCESS.2019.28957518629082Detecting Malicious URLs via a Keyword-Based Convolutional Gated-Recurrent-Unit Neural NetworkWenchuan Yang0Wen Zuo1https://orcid.org/0000-0002-7137-8014Baojiang Cui2School of Cyberspace Security, Beijing University of Posts and Telecommunications, Beijing, ChinaSchool of Cyberspace Security, Beijing University of Posts and Telecommunications, Beijing, ChinaSchool of Cyberspace Security, Beijing University of Posts and Telecommunications, Beijing, ChinaWith the continuous development of Web attacks, many web applications have been suffering from various forms of security threats and network attacks. The security detection of URLs has always been the focus of Web security. Many web application resources can be accessed by simply entering an URL or clicking a link in the browser. An attacker can construct various web attacks such as SQL, XSS, and information disclosure by embedding executable code or injecting malicious code into the URL. Therefore, it is necessary to improve the reliability and security of web applications by accurately detecting malicious URLs. This paper designs a convolutional gated-recurrent-unit (GRU) neural network for the detection of malicious URLs detection based on characters as text classification features. Considering that malicious keywords are unique to URLs, a feature representation method of URLs based on malicious keywords is proposed, and a GRU is used in place of the original pooling layer to perform feature acquisition on the time dimension, resulting in high-accuracy multicategory results. The experimental results show that our proposed neural network detection model is very suitable for high-precision classification tasks. Compared with other classification models, the model accuracy rate is above 99.6%. The use of deep learning to classify URLs to identify Web visitors' intentions has important theoretical and scientific values for Web security research, providing new ideas for intelligent security detection.https://ieeexplore.ieee.org/document/8629082/Gated recurrent unit (GRU)malicious URL detectionnetwork attackcharacter-level embeddingconvolutional neural networkneural network model
collection DOAJ
language English
format Article
sources DOAJ
author Wenchuan Yang
Wen Zuo
Baojiang Cui
spellingShingle Wenchuan Yang
Wen Zuo
Baojiang Cui
Detecting Malicious URLs via a Keyword-Based Convolutional Gated-Recurrent-Unit Neural Network
IEEE Access
Gated recurrent unit (GRU)
malicious URL detection
network attack
character-level embedding
convolutional neural network
neural network model
author_facet Wenchuan Yang
Wen Zuo
Baojiang Cui
author_sort Wenchuan Yang
title Detecting Malicious URLs via a Keyword-Based Convolutional Gated-Recurrent-Unit Neural Network
title_short Detecting Malicious URLs via a Keyword-Based Convolutional Gated-Recurrent-Unit Neural Network
title_full Detecting Malicious URLs via a Keyword-Based Convolutional Gated-Recurrent-Unit Neural Network
title_fullStr Detecting Malicious URLs via a Keyword-Based Convolutional Gated-Recurrent-Unit Neural Network
title_full_unstemmed Detecting Malicious URLs via a Keyword-Based Convolutional Gated-Recurrent-Unit Neural Network
title_sort detecting malicious urls via a keyword-based convolutional gated-recurrent-unit neural network
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description With the continuous development of Web attacks, many web applications have been suffering from various forms of security threats and network attacks. The security detection of URLs has always been the focus of Web security. Many web application resources can be accessed by simply entering an URL or clicking a link in the browser. An attacker can construct various web attacks such as SQL, XSS, and information disclosure by embedding executable code or injecting malicious code into the URL. Therefore, it is necessary to improve the reliability and security of web applications by accurately detecting malicious URLs. This paper designs a convolutional gated-recurrent-unit (GRU) neural network for the detection of malicious URLs detection based on characters as text classification features. Considering that malicious keywords are unique to URLs, a feature representation method of URLs based on malicious keywords is proposed, and a GRU is used in place of the original pooling layer to perform feature acquisition on the time dimension, resulting in high-accuracy multicategory results. The experimental results show that our proposed neural network detection model is very suitable for high-precision classification tasks. Compared with other classification models, the model accuracy rate is above 99.6%. The use of deep learning to classify URLs to identify Web visitors' intentions has important theoretical and scientific values for Web security research, providing new ideas for intelligent security detection.
topic Gated recurrent unit (GRU)
malicious URL detection
network attack
character-level embedding
convolutional neural network
neural network model
url https://ieeexplore.ieee.org/document/8629082/
work_keys_str_mv AT wenchuanyang detectingmaliciousurlsviaakeywordbasedconvolutionalgatedrecurrentunitneuralnetwork
AT wenzuo detectingmaliciousurlsviaakeywordbasedconvolutionalgatedrecurrentunitneuralnetwork
AT baojiangcui detectingmaliciousurlsviaakeywordbasedconvolutionalgatedrecurrentunitneuralnetwork
_version_ 1724191846900432896