Enhanced-PCA based Dimensionality Reduction and Feature Selection for Real-Time Network Threat Detection

With the rise of the data amount being collected and exchanged over networks, the threat of cyber-attacks has also increased significantly. Timely and accurate detection of any intrusion activity in networks has become a crucial task in order to safeguard data and other valuable assets. While manual...

Full description

Bibliographic Details
Main Authors: P. More, P. Mishra
Format: Article
Language:English
Published: D. G. Pylarinos 2020-09-01
Series:Engineering, Technology & Applied Science Research
Subjects:
Online Access:http://etasr.com/index.php/ETASR/article/view/3801
id doaj-6779b79cc5fd483bbf9efa21baeedccc
record_format Article
spelling doaj-6779b79cc5fd483bbf9efa21baeedccc2020-12-02T18:33:41ZengD. G. PylarinosEngineering, Technology & Applied Science Research2241-44871792-80362020-09-01105Enhanced-PCA based Dimensionality Reduction and Feature Selection for Real-Time Network Threat DetectionP. More0P. Mishra1Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, IndiaDepartment of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, IndiaWith the rise of the data amount being collected and exchanged over networks, the threat of cyber-attacks has also increased significantly. Timely and accurate detection of any intrusion activity in networks has become a crucial task in order to safeguard data and other valuable assets. While manual moderation and programmed logic have been used for this purpose, the use of machine learning algorithms for superior pattern mapping is desired. The system logs in a network tend to include many parameters, and not all of them provide indications of an impending network threat. The selection of the right features is thus important for achieving better results. There is a need for accurate mapping of high dimension features to low dimension intermediate representations while retaining crucial information. In this paper, an approach for feature reduction and selection when working on the task of network threat detection is proposed. This approach modifies the traditional Principal Component Analysis (PCA) algorithm by working on its shortcomings and by minimizing the false detection rates. Specifically, work has been done upon the calculation of symmetric uncertainty and subsequent sorting of features. The performance of the proposed approach is evaluated on four standard-sized datasets that are collected using the Microsoft SYSMON real-time log collection tool. The proposed method is found to be better than the standard PCA and FAST methods for data reduction. The proposed approach makes a strong case as a dimensionality reduction and feature selection technique for minimizing false detection rates when operating on real-time data. http://etasr.com/index.php/ETASR/article/view/3801principal component analysisfast clusteringdimensionality reductionmachine learningnetwork security
collection DOAJ
language English
format Article
sources DOAJ
author P. More
P. Mishra
spellingShingle P. More
P. Mishra
Enhanced-PCA based Dimensionality Reduction and Feature Selection for Real-Time Network Threat Detection
Engineering, Technology & Applied Science Research
principal component analysis
fast clustering
dimensionality reduction
machine learning
network security
author_facet P. More
P. Mishra
author_sort P. More
title Enhanced-PCA based Dimensionality Reduction and Feature Selection for Real-Time Network Threat Detection
title_short Enhanced-PCA based Dimensionality Reduction and Feature Selection for Real-Time Network Threat Detection
title_full Enhanced-PCA based Dimensionality Reduction and Feature Selection for Real-Time Network Threat Detection
title_fullStr Enhanced-PCA based Dimensionality Reduction and Feature Selection for Real-Time Network Threat Detection
title_full_unstemmed Enhanced-PCA based Dimensionality Reduction and Feature Selection for Real-Time Network Threat Detection
title_sort enhanced-pca based dimensionality reduction and feature selection for real-time network threat detection
publisher D. G. Pylarinos
series Engineering, Technology & Applied Science Research
issn 2241-4487
1792-8036
publishDate 2020-09-01
description With the rise of the data amount being collected and exchanged over networks, the threat of cyber-attacks has also increased significantly. Timely and accurate detection of any intrusion activity in networks has become a crucial task in order to safeguard data and other valuable assets. While manual moderation and programmed logic have been used for this purpose, the use of machine learning algorithms for superior pattern mapping is desired. The system logs in a network tend to include many parameters, and not all of them provide indications of an impending network threat. The selection of the right features is thus important for achieving better results. There is a need for accurate mapping of high dimension features to low dimension intermediate representations while retaining crucial information. In this paper, an approach for feature reduction and selection when working on the task of network threat detection is proposed. This approach modifies the traditional Principal Component Analysis (PCA) algorithm by working on its shortcomings and by minimizing the false detection rates. Specifically, work has been done upon the calculation of symmetric uncertainty and subsequent sorting of features. The performance of the proposed approach is evaluated on four standard-sized datasets that are collected using the Microsoft SYSMON real-time log collection tool. The proposed method is found to be better than the standard PCA and FAST methods for data reduction. The proposed approach makes a strong case as a dimensionality reduction and feature selection technique for minimizing false detection rates when operating on real-time data.
topic principal component analysis
fast clustering
dimensionality reduction
machine learning
network security
url http://etasr.com/index.php/ETASR/article/view/3801
work_keys_str_mv AT pmore enhancedpcabaseddimensionalityreductionandfeatureselectionforrealtimenetworkthreatdetection
AT pmishra enhancedpcabaseddimensionalityreductionandfeatureselectionforrealtimenetworkthreatdetection
_version_ 1724403836653666304